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operative  period. 

?.  Verify  the  reliability  of  several  methods  of  research. 
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4.  Determining  the  efficiency  of  treatment  methods. 
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However,  the  results  of  the  use  of  the  information  search  systems  in  medical 
practice  will  not  be  long  in  being  felt,  anti  even  at  the  very  firs’ 
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FROM  THE  EDITOR 


It  apparently  will  not  be  a  great  exaggeration  to  say  that  the 
development  of  science  in  the  near  future  will  to  a  considerable  degree 
be  determined  by  the  organization  of  the  information  services.  Today  we 
can  speak  of  the  appearance  of  a  new  discipline  -  documentalistics, 
which  concerns  the  questions  of  the  mechanization  of  the  process  of 
collection,  storage,  and  retrieval  of  information.  The  development  of 
this  discipline  may  be  followed  in  the  individual  issues  of  the  ab¬ 
stract  journal  "Scientific  and  Engineering  Information, "  published  by 
VINITI  of  the  Academy  of  Sciences  USSR.  In  1964  this  journal  summarized 
more  than  3000  publications  relating  to  documentalistics.  Search  for 
scientific  Information  Is  a  complex  intellectual  process.  It  Is  based 
on  several  psychological  factors:  intuition,  ability  to  associate 
Ideas,  the  sense  of  satisfaction  from  the  results  of  successful  search, 
etc.  In  an  attempt  to  mechanize  this  process  we  encounter  the  "man-ma¬ 
chine"  problem,  one  of  the  basic  problems  of  cybernetics. 

The  present  volume  is  the  first  collection  of  Soviet  works  on  doc¬ 
umentalistics.  The  papers  selected  for  publication  have  been  presented 
and  discussed  at  seminars  on  documentalistics  of  the  Scientific  Council 
on  Cybernetics  of  the  Academy  of  Sciences  USSR. 
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THE  PROBLEM  OP  DOCUMENTAL  INFORMATION 
G.G.  Vorob'yev 

We  live  in  the  century  of  electronic  machines  and  information 
processing  [1].  The  wide  usage  of  the  term  "information,"  which  we  got 
along  without  quite  easily  up  until  now,  apparently  is  an  indication  of 
the  appearance  of  a  special  problem.  Its  formulation  may  be  expressed 
by  the  words:  the  human  organism  can  no  longer  cope  with  the  informa¬ 
tion  being  received  and  transmitted;  the  creation  of  external  memory 
organs  -  libraries  and  archives  -  provided  for  progress  only  at  a  def¬ 
inite  stage  of  the  development  of  society;  new  revolutionary  transfor¬ 
mations  are  necessary,  whose  importance  will  be  no  less  than  that  of 
the  appearance  of  the  first  books. 

Unfortunately,  we  do  not  really  understand  the  scale  of  this  prob¬ 
lem. 

"The  accumulation  of  human  experience  proceeds  with  amazing  rapid- 
ity,  but  the  means  which  we  utilize  to  struggle  through  the  complex 
labyrinth  to  the  objective  which  is  of  interest  to  us  at  a  given  time 
remain  the  same  as  in  the  time  of  sailing  vessels"  [2,  page  7].  "The 
volume  of  knowledge  accumulated  in  one  form  or  another  appears  to  some 
of  us  in  the  form  of  a  modern  dinosaur.  These  ancient  animals  became 
extinct,  and  in  part  their  disappearance  is  ascribed  to  the  circum¬ 
stance  that  they  were  too  large:  their  brain  could  not  control  the  tre¬ 
mendous  body.  The  volume  of  information  accumulated  by  man  also  may  be¬ 
come  so  great  that  it  is  impossible  to  effectively  utilize  its  individ¬ 
ual  parts"  [3].  In  many  fields  the  situation  is  being  created  when  es- 
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sentially  it  is  easier  to  discover  a  new  fact  or  create  a  new  theory 
than  to  ascertain  that  it  has  not  yet  been  created  or  derived.  "...  The 
problem  is  not  that  we  are  busy  with  excessive  publication  as  a  result 
of  the  scope  and  variety  of  modern  interests,  but  rather  that  the  vol¬ 
ume  of  the  publication  Is  growing  far  beyond  the  limits  of  our  true 
capability  of  really  utilizing  these  data"  [2,  page  23]. 

Books  and  Journals 

Many  indices  of  the  development  of  human  society  grow  exponential¬ 
ly,  i.e.,  they  increase  ten- fold  every  50  years  [4],  An  example  are  the 


Fig.  1.  Exponential  growth  of 
number  of  scientific  Journals. 

1)  Number  of  Journal  titles;  2) 
scientific  Journals;  3)  abstract 
Journals;  *0  years. 

scientific  Journals,  whose  number  has  Increased  by  a  factor  of  10,000 
in  200  years  (1750-1950)  (Fig.  1). 

The  annual  volume  of  printed  information  amounts  to  7  billion  pag¬ 
es  today,  including  2-3  million  scientific  articles  (60  million  pages), 
60,000  book  titles,  200,000  descriptions  for  authors'  certificates  and 
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patents,  4  million  publications  of  an  economic  nature  (advertisements, 
reports,  predictions,  etc.)  [5-8].  Every  year  500  new  Journals  on  chem¬ 
istry  appear  [9]  and  in  the  USA  alone  there  are  25  new  agricultural 
Journals  [10],  During  40  years  the  All-Union  Book  Chamber  has  recorded 
more  than  22  million  items  of  all  forms  of  Soviet  printed  material 
[11]. 

The  traditional  forms  of  books  ar.d  Journals  do  not  answer  the  re¬ 
quirements  of  today.  Let  us  list  the  primary  deficiencies:  the  increas¬ 
ing  difficulty  of  publication  (excess  of  the  author's  supply  over  the 
reader’s  demand),  shortening  of  the  volume  of  scientific  articles  with 
the  loss  of  useful  information,  scattering  of  the  interests  of  the 
readers  among  many  Journals  and  the  associated  decrease  of  the  number 
of  subscribers,  discrepancy  between  the  number  of  readers  and  the  pub¬ 
lication  volume  of  articles,  publication  delay,  increase  of  the  gap  be¬ 
tween  the  obsolescence  of  published  material  and  its  availability,  in¬ 
convenience  of  storage. 

In  spite  of  the  growth  of  the  number  of  Journals,  the  number  of 
potential  authors  is  growing  even  faster.  Today’s  publishing  service 
cannot  cope  with  the  stream  of  manuscripts  and  attempts  to  reduce  this 
stream  mechanically.  For  example,  the  well-known  Journal  "Zavodskaya 
laboratoriya”  [Industrial  Laboratory]  during  20  years  of  existence  has 
somewhat  Increased  its  size  (primarily  by  use  of  internal  techniques  - 
reduction  of  margins,  smaller  type),  has  cut  in  half  the  average  arti¬ 
cle  size,  and  still  is  forced  to  reject  almost  half  the  manuscripts 
solely  for  lack  of  space  [4],  Another  Journal  "Physical  Review"  during 
10  years  (1946-1956)  increased  the  size  of  the  annual  volume  from  1734 
to  734?  pages,  collects  a  payment  of  $25  per  page  from  the  authors,  and 
is  proposing  to  limit  Itself  to  abstracts  of  the  articles  in  the  fu¬ 
ture,  publishing  the  latter  only  on  requests  of  the  readers  [12]. 


Thus,  we  encounter  the  first  contradiction:  there  are  too  many 
journals  and  too  few;  the  editors  in  their  own  way  are  correct  in 
shortening  the  material,  but  at  the  same  time  they  are  doing  irrepara¬ 
ble  damage  to  scientific  and  technical  information.  Let  us  clarify  this 
last  idea  by  an  example.  A  scientist  has  conducted  in  his  laboratory  a 
major  experiment  and  has  published  a  lengthy  article  concerning  it, 
where  he  has  described  not  only  the  experimental  conditions  and  the  re¬ 
sults  obtained  but  also  secondary  phenomena  which  he  has  not  been  able 
to  explain  definitely  because  of  lack  of  facts.  It  is  obvious  that  a 
brief  communication  will  only  be  of  a  publicity  nature,  it  can  neither 
be  utilized  nor  verified  [13]. 

Along  with  the  increase  of  the  number  of  journals  there  is  a  re¬ 
duction  of  the  relative  number  of  subscribers.  The  reason  for  this  is 
not  so  much  the  cost  [14-16],  as  the  specialization  and  scattering  of 
interests  of  the  readers.  On  the  average,  the  fields  of  interest  of  the 
user  of  documental  information  in  1962  were  reduced  by  half  in  compari¬ 
son  with  the  figure  for  8  years  previously,  and  will  again  reduce  by  a 
factor  of  two  by  1970  [3].  In  each  issue  of  a  journal  we  are  interested 
on  the  average  in  two  or  three  articles  which  we  would  like  to  retain, 
but  there  are  tens  or  hundreds  of  such  journals  [17].  In  order  to  sub¬ 
scribe  to  all,  we  would  need  a  lot  of  money,  a  lot  of  available  space, 
and  in  all  cases  we  would  obtain  a  mass  of  ’’waste  paper"  in  which  there 
is  invested  the  labor  of  the  papermakers,  printers,  binders,  and  others 
and  which  will  occupy  space,  interfering  with  us  in  searching  for  t'ne 
required  information.  We  might  add  that  the  readers  of  each  article 
differ  not  only  qualitatively,  but  also  quantitatively:  hundreds  are 
interested  in  one  article,  millions  of  specialists  are  interested  in 
another  article,  but  both  articles  will  be  located  side  by  side  in  an 
issue  of  the  Journal  and  are  thus  presented  to  10,000  subscribers. 
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We  often  pride  ourselves  on  modern  polygraphic  techniques,  but  we 
don't  stop  to  think  that  they  are  capable  of  increasing  the  output  but 
not  the  speed  of  publication.  The  great  Russian  printer  Ivan  Fedorov 
set  and  printed  the  text  of  Evangeliye  uchitel'noye  in  8  months  and  9 
days.  After  400  years  the  11th  volume  of  the  complete  collection  of  the 
works  of  Charles  Dickens,  not  requiring  author's  correction,  required 
8  months  and  20  days  for  printing,  and  the  30 th  volume  required  more 
than  a  year. 

But  even  after  publication  the  article  still  must  follow  a  long 
and  tortuous  path  to  the  person  interested  in  it.  This  applies  particu¬ 
larly  to  periodic  and  nonperiodic  volumes  containing  the  works  of  sci¬ 
entific  research  institutes  and  the  educational  institutions  where  most 
scientific  work  is  concentrated.  Their  circulation  is  small  and  the  ab¬ 
stract  services  are  reluctant  to  acquire  and  process  them  under  the 
pretext  that  in  the  course  of  time  they  will  be  republished  in  the 
leading  journals.  This  may  be  why  the  average  accessibility  of  the  lit¬ 
erature  is  extremely  low:  7555  in  physics  10  years  after  publication, 

55?  in  chemistry,  and  still  less  in  the  other  sciences  (Fig.  2).  We  re¬ 


lished  scientific  and  engineer¬ 
ing  information.  1)  Chemistry; 
2)  physics;  3)  mathematics,  a) 
Literature  accessibility,  ?;  b) 
time  after  publication,  years. 
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call  that  the  USA  industry  became  aware  of  the  development  of  the  So¬ 
viet  turbine  drills  only  6-7  years  after  publication  in  the  Soviet 
press  [16],  But  on  the  average  the  scientific  and  engineering  litera¬ 
ture  is  out  of  date  after  10-15  years  [18]. 

Abstract  Service 

In  the  first  half  of  the  twentieth  century  a  second  form  of  infor¬ 
mation  won  wide  acceptance  -  the  specialized  abstract  volumes  which 
present  a  brief  summary  of  the  contents  of  the  published  scientific  ar¬ 
ticles.  However,  later  on  all  the  deficiencies  of  the  conventional 
journals  were  repeated  here:  delays,  unwieldyness,  expense,  inconven¬ 
ience  in  use,  small  fraction  of  useful  information  (for  the  single  sub¬ 
scriber),  etc. 

The  abstract  journals  quickly  filled  the  shelves  of  the  libraries, 
becoming  a  technical  problem  of  their  own.  One  of  the  largest  informa¬ 
tion  services  ''Chemical  Abstracts1'  from  1907  to  I960  increased  the  num¬ 
ber  of  chemical  journals  abstracted  annually  from  475  to  9800  titles, 
the  number  of  abstracts  to  104,500,  and  the  number  of  chemical  com¬ 
pounds  described  to  160,000  (including  100,000  new  compounds  [9],  14 
information  services  of  the  USA  prepared  in  I960  600,000  abstracts  and 
bibliographic  descriptions,  which  amounts  to  about  10 %  of  the  published 
scientific  information  [19].  In  the  USSR  in  1953  there  was  created  a 
combined  "Abstract  Journal"  on  the  basis  of  the  All-Union  Institute  of 
Scientific  and  Technical  Information  -  VINITI.  By  1962  this  Journal  in¬ 
creased  the  number  of  articles  handled  to  781,000  (from  102  countries 
of  the  world),  still  without  encompassing  everything  either  with  re¬ 
spect  to  branches  of  knowledge  or  with  respect  to  number  of  publica¬ 
tions  . 

Each  year  the  world  spends  200  million  dollars  on  abstract  servic- 
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es  [20],  but  a  very  small  percentage  of  this  sum  has  any  appreciable 
effect.  The  reason  is  that  no  one  has  yet  decided  how  to  accomplish 
complete  coverage  of  the  literature,  whether  we  must  abstract  all  arti¬ 
cles  and  how  many  times  (In  various  aspects)  to  abstract  a  single  arti¬ 
cle  [3].  The  fields  of  Interest  of  the  users  of  documental  Information 
are  not  only  narrowing  (differentiating)  [3],  but  are  also  diffusing 
(Integrating)  [17]  among  many  scientific  and  abstract  journals  and  the 
individual  sections  of  these  journals.  The  article  "Statistical  proc¬ 
essing  of  the  results  of  isotope  measurements  0*  lead  In  meteorites" 
simply  on  the  basis  of  the  title  must  be  applicable  to  the  circle  of 
interests  of  geologists,  astronomers,  physicists,  geophysicists,  geo¬ 
chemists,  chemists,  and  mathematicians. 

The  writing  of  a  universal  abstract  meanB  nearly  complete  repeti¬ 
tion  of  the  entire  article  or  not  writing  anything.  Even  an  article 
which  is  abstracted  in  one  section  of  a  broad-field  Journal  may  be  lost 
for  the  readers  of  other  sections  who  do  not  happen  to  glance  in  the 
neighboring  section,  do  not  have  enough  time  for  this,  or  simply  do  not 
suspect  from  the  specialized  abstract  that  there  is  useful  information 
in  this  article. 

Libraries 

The  situation  in  the  libraries  is  no  less  difficult.  The  total 
number  of  accumulated  printed  works  amounts  today  to  100  million  ti¬ 
tles,  of  which  nearly  50  million  are  books  and  about  10  million  pat¬ 
ents.  The  V.I.  Lenin  State  Order  of  Lenin  Library  of  the  USSR  [Lenin 
Library],  one  of  the  largest  in  the  world,  has  22  million  books,  jour¬ 
nals,  and  bound  volumes  of  newspapers,  which  cccupy  300  km  of  book¬ 
shelves;  and  the  shelf  length  increases  by  15  km  each  year  [11] . 

"I  believe  that  the  majority  of  people  instinctively  do  not  like 


libraries  and  rate  them  somewhere  after  dentists  on  the  basis  of  the 
inconveniences  which  they  cause,”  noted  at  one  time  one  of  the  English 
scientists  [22].  The  huge  number  of  books  and  Journals,  to  which,  un¬ 
fortunately,  they  do  not  always  have  direct  access,  have  a  depressing 
effect  on  the  library  visitor. 

And  the  library  catalogs  are  lots  of  trouble  and  of  little  use. 

The  author  catalog  -  an  intrinsic  part  of  every  library  -  apparently 
satisfies  only  the  lover  of  classical  fiction,  but  is  nearly  completely 
unnecessary  in  a  scientific  library  where  the  reader  is  interested  not 
in  who  wrote  on  a  given  subject,  but  in  what  has  been  written  on  this 
subject.  Another  error  of  the  librarian:  giving  more  attention  to  the 
books  than  to  articles.  As  a  rule,  the  books  are  "in  the  rear"  of  sci¬ 
ence  and  serve  certain  definite  purposes:  as  textbooks,  popular  re¬ 
views  for  expanding  the  horizon,  and  as  handbooks  on  the  desk  of  a  spe¬ 
cialist,  but  those  who  are  at  the  "leading  edge"  need  primarily  the 
scientific  articles. 

The  so-called  systematic  and  subject  catalogs  are  compiled  using  a 
rough  classification  scheme  >n  the  basis  of  book  titles.  To  accomplish 
this  there  is  a  two  or  three-fold  duplication  of  the  library  card 
files.  But  every  article  and  more  especially  books  correspond  not  to 
two  or  three,  but  to  many  dozen  subjects,  and  only  with  a  suitable 
breakdown  do  they  become  really  effective  [17].  According  to  the  calcu¬ 
lations  of  Voigt  [30],  the  users  obtain  i»— 7 %  of  the  scientific  informa¬ 
tion  with  the  aid  of  the  library,  35!*  as  a  result  of  independent  liter¬ 
ature  searches,  and  6-20)*  as  a  result  of  their  own  memory,  leading  them 
to  the  required  original  sources. 

The  production  of  card-catalog  bibliography  in  the  USSR  is  accom¬ 
plished  primarily  by  the  All-Union  Book  Chamber,  which  produces  in  a 
year  several  hundred  thousand  cards  with  a  total  circulation  of  several 
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hundred  million  copies  and  which  operates  18  catalogs  with  9  million 
cards  [11,  23].  The  question  naturally  arises:  "Is  It  advisable  to  cat¬ 
alog  so  extensively  these  materials,  can  the  libraries  utilize  In  their 
card  catalogs  75-80,000  cards  per  year?"  [23].  We  can  glance  In  any  li¬ 
brary  and  see  that  the  majority  of  the  cards  go  Into  the  wastepaper 
basket  and  In  the  best  of  cases  (the  back  side)  Is  used  for  other  pur¬ 
poses. 

The  primary  difficulties  in  the  operation  of  the  libraries  lie  in 
the  absence  of  any  mechanization  [24],  the  limited  funds,  area,  and 
frequently  In  the  reluctance  to  change  a  system  which  has  been  devel¬ 
oped  over  the  years.  Only  these  factors  can  explain  the  understaffing 
of  many  libraries,  the  orientation  of  engineers  and  scientists  toward 
the  specialized  rather  than  universal  (public)  libraries  [25],  and  the 
low  utilization  of  the  materials.  For  example,  more  than  half  tbs  books 
of  the  Lenin  Library  have  not  been  used  by  anyone,  apparently  not  be¬ 
cause  these  books  are  not  required  by  anyone,  simply  the  readers  do  not 
know  of  their  existence. 

Other  Documental  Information  Services 

So  far  we  have  discussed  only  the  so-called  scientific  and  techni¬ 
cal  information,  whose  principal  task  is  the  collection,  processing, 
and  distribution  of  two  opposing  information  streams  -  internal  and  ex¬ 
ternal..  which  include  information  concerning  new  scientific  investiga¬ 
tions,  disclosures,  and  inventions.  The  libraries  combine  the  scientif¬ 
ic  and  technical  information  with  the  archive  and  museum  information, 
where  the  problem  of  record  keeping  is  just  as  important.  Today  the 
state  archives  contain  tens  and  hundreds  of  millions  of  documents.  If 
at  the  present  time  years  are  spent  in  searching  for  documents  and  we 
do  not  always  by  any  means  find  what  we  need  if  because  of  lack  of 
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space  the  older  archives  have  been  destroyed  prematurely,  this  means 
the  tasks  of  the  archive  operation  have  lot  yet  been  resolved.  It  Is 
remarkable  that  with  the  decrease  of  the  probability  of  major  geograph¬ 
ical  discoveries  there  Is  a  continuous  Increase  of  the  probability  of 
no  less  Important  discoveries  in  our  archives  [263.  Construction  draw¬ 
ings,  the  most  Inconvenient  form  of  documentation  (folded  sheets  of 
large  format)  occupy  a  speclil  category  of  archives.  For  the  construc¬ 
tion,  for  example,  of  a  ship  we  require  from  3  to  5000  drawing  sheets 
and  other  documentation  involving  10,000  pages.  These  drawings  are 
stored  by  number  and  are  actually  completely  inaccessible  for  the  work¬ 
er  who  usually  asks  the  question  of  where  and  when  a  similar  design  has 
already  been  developed  or  in  what  construction  a  particular  component 
has  previously  been  used  in  combination  with  some  other  particular  com¬ 
ponent.  In  certain  design  bureaus  the  working  drawings  are  destroyed 
after  a  few  months since  within  the  limits  of  this  time  period  the  so- 
called  design  memory  operates,  permitting  retrieval  of  the  required 
drawing  among  the  other  stored  drawings.  When  drawings  are  accumulated 
over  several  years  such  searches  are  impossible.  As  for  the  museums, 
here  thanks  to  the  efforts  of  scholars,  historians,  archeologists,  and 
regional  scholars  such  a  large  number  of  specimens  has  been  collected 
that  the  majority  cannot  be  located  in  expositions  and  are  stored  so 
that  they  are  not  accessible  for  viewing.  The  question  arises  of  how  to 
create  an  information  system  which,  for  example,  will  enable  each  ar¬ 
cheological  find  to  become  immediately  accessible  to  all  other  sreheol- 
ogists,  and  so  that  the  visitor  to  one  museum  would  have  access  to  the 
exhibits  of  all  other  museums  [27]. 

The  third  form  of  documental  information  Is  economic  (production, 
commercial,  transport,  public  service,  etc.).  The  rapid  growth  of  sci¬ 
ence  and  engineering  entails  an  expansion  of  the  old  Industrial  enter- 
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prises  and  the  appearance  of  new  industrial  giants.  In  these  enterpris¬ 
es  use  is  made  of  hundreds  of  forms  of  machines,  consisting  of  many 
thousands  of  varieties  of  standard  components  and  parts,  and  hundreds 
of  production  items  are  produced.  The  planning  section  of  such  an  en¬ 
terprise  must  carry  out  periodic  reviews  of  the  production  plans  and 
operating  programs,  detanr'ne  the  actual  volume  of  the  product  released 
for  production,  determine  the  required  amount  of  raw  material,  calcu¬ 
late  the  proposed  departmental  loading,  follow  the  progress  of  the 
technological  process  and  the  condition  of  the  machinery,  keep  track  of 
the  operating  documentation,  and  process  statistically  the  data  ob¬ 
tained.  The  activity  of  the  supply  and  sales  divisions  must  be  corre¬ 
lated  precisely  with  this  operation,  without  which  smooth  operation  of 
the  enterprise  is  Impossible. 

The  dense  network  of  commercial  communications,  which  long  ago  ac¬ 
quired  an  international  nature,  creates  at  the  points  of  intersection  - 
reshipment  points  (marine,  river  and  airports,  highway  and  railway 
Junctions,  warehouses  and  storage  areas)  —  accumulations  of  the  most 
varied  cargoes  which  require  accounting  and  distribution.  The  informa¬ 
tion  operations  in  transportation  include  the  recording  of  transit  car¬ 
goes,  integration  of  cargoes,  and  also  the  purely  dispatcher  functions 
-  the  compiling  and  monitoring  of  schedules  with  efficient  utilization 
of  time,  transport  routes  and  facilities.  Under  the  conditions  in  large 
stores  correctly  presented  information  makes  it  possible  to  account  for 
user  demand,  provide  uninterrupted  supply,  and  prevent  the  accumulation 
of  slow-moving  goods  [2]. 

There  is  still  another  form,  normally  termed  business  documenta¬ 
tion,  which  is  closely  allied  with  economic  documentation,  but  has  a 
broader  nature.  It  includes  primarily  correspondence  and  various  admin¬ 
istrative  and  legal  papers:  orders,  instructions,  decisions,  etc.  The 
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old  familiar  expression  "red  tape”  at  first  glance  Is  directed  against 
paper  -  the  information  carrier,  just  as  we  once  heard  the  cry  "Down 
with  the  machine!"  Actually,  at  the  moment  we  must  talk  about  the  de¬ 
struction  not  of  papers  themselves  but  rather  the  old-fashioned  methods 
of  handling  paperwork,  we  must  place  the  operation  of  the  office  on  the 
same  level  as  that  of  the  automated  shop. 

Organization  of  Mental  Work 

The  problem  of  documental  information  is  closely  allied  with  the 
general  problem  of  the  organization  of  mental  work.  The  fact  that  the 
rate  of  increase  of  the  number  cf  managing,  office,  and  scientific  per¬ 
sonnel  exceeds  the  rate  of  increase  of  the  world's  population  [4]  is 
indicative  of  the  unsatisfactory  organization  of  important  spheres  of 
human  activity.  In  essence  all  the  efforts  of  the  recent  centuries  have 
been  directed  toward  increasing  the  productivity  cf  labor,  while  the 
work  of  the  engineer  or  the  scientist  remains  practically  unchanged.  We 
write,  read,  talk,  and  seek  solutions  all  7  hours  of  the  working  day, 
thinking  about  the  decisions  we  have  made  in  the  most  unlikely  sur¬ 
roundings  —  at  a  meeting,  in  the  dining  room,  or  on  the  bus.  Only  a 
single  process  of  mental  labor  has  been  mechanized  to  an  adequate  de¬ 
gree.  This  is  the  process  of  calculating,  which  is  now  accomplished 
with  the  aid  of  adding  machines,  desk-top  electromechanical  computers, 
and  powerful  electronic  units  which  permit,  for  example,  after  12  years 
(19^6-1957)  reduce  the  time  for  calculating  707  decimal  digits  of  the 
number  tt  from  25  minutes  to  25  seconds  [1].  The  time  has  come  for  the 
emancipation  of  mental  labor,  reducing  the  proportion  of  the  subsidiary 
processes  ’-’hich  accompany  the  primary  process  -  thinking. 

At  the  Case  Institute  of  Technology  in  the  USA  the  time  (in  %)  of 
the  scientific  worker  is  distributed  as  follows  [28]: 
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scientific  information  (contact  for 

information  purposes)  33. 4 
business  information  10.4 
thinking  (planning)  6.0 
equipment  setup  6 . 2 
use  of  equipment  (conduct  of  experiment)  23.4 
data  reduction  6.4 
personnel  and  community  affairs  9.8 
various  4 . 4 


The  quantity  33-4%  is,  apparently,  typical,  and  only  chemists,  as 
a  rule,  expend  more  time  on  information  (in  the  strategic  branches  up 
to  94$)  [4],  On  the  average,  the  chemical  engineer  reads  during  his 
life  0.5%  of  these  published  specialized  literature,  and  half  of  this 
quantity  relates  to  information  of  a  general  nature  and  has  no  connec¬ 
tion  with  the  work  he  is  performing  [29]-  A  colorful  description,  re¬ 
lating  equally  well  to  the  Soviet  specialist,  is  found  in  the  article 
of  the  famous  French  doeumentalist  G.  Cordonnier  [17]:  "Many  engineers 
working  in  large  enterprises  and  in  institutions  have  the  habit  of 
keeping  on  their  table  a  pile  of  journals  which  they  never  have  enough 
time  to  read,  meanwhile  these  same  journals  are  awaited  anxiously  by 
other  readers  who  are  less  "high"  and  therefore  are  located  further 
down  the  reading  list.  When  this  pile  becomes  excessively  bulky,  it  is 
taken  care  of  decisively,  glancing  hurriedly  in  one  journal,  leafing 
through  another,  and  as  a  result  reading  nothing."  This  situation  ex¬ 
ists  even  in  spite  of  the  fact  that  far  from  all  results  of  investiga¬ 
tions  and  discoveries  are  written  up  in  the  form  of  articles,  and  tech¬ 
nical  reports,  written  in  a  hurry,  leave  much  to  be  desired  with  regard 
to  quality. 

These  deficiencies  are  reflected  in  the  extremely  low  productivity 
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of  mental  labor  which  In  the  aircraft  and  radio  Industry  of  the  USA  Is 
estimated  at  10*  [31].  Therefore  there  Is  mass  duplication  of  sclentif. 
le  studies  and  engineering  decisions,  as  a  result  of  which  30-85*  of 
the  working  time  Is  Irretrievably  lost  [34],  The  Department  of  Defense 
of  the  USA  alone  expends  1-2  billion  dollars  annually  as  a  result  of 
duplication.  The  principal  causes  of  duplication  are  [31]:  1)  secrecy; 
2)  difficulties  in  disseminating  and  collecting  Information  (Imperfec¬ 
tion  of  the  system);  3)  limitations  in  Information  exchange  as  a  result 
of  competition  and  interdepartmental  barriers;  4)  deficiencies  In  the 
organization  of  the  enterprise  operation  (internal  Information);  5)  Im¬ 
proper  understanding  of  the  problem  (terminological  and  other  difficul¬ 
ties);  6)  governmental  limitations  on  the  exchange  or  Information  be- 
tween  countries. 

Doeumentallstics  -  a  New  Branch  of  Cybernetics 

The  word  "documentation”  has  a  three-fold  meaning.  We  may  under¬ 
stand  by  this  the  process  of  compiling  documents,  the  set  of  documents 
on  a  given  question,  and  a  specific  scientific  discipline.  The  latter 
(Its  synonym  is  documentary  science)  is  devoted  to  the  official  and 
technical  questions  of  operation  with  documents  and  formally  Includes 
such  practical  aspects  as  scientific  and  engineering  information,  ar¬ 
chive  science,  library  science,  patent  affairs,  museum  management,  of- 
flee  clerical  work,  etc. 

The  element  of  documentation  is  the  document  -  fixed  Information 
which  may  be  used  for  consultation,  study,  and  proof.  It  may  be  shown 
that  at  the  present  time  every  fact  is  enclosed  by  clothing  consisting 
of  documents  [32],  whose  forms  become  extremely  varied  -  these  are 
books,  newspapers,  journals,  photographs  and  movie  films,  magnetic  and 
Phonograph  recordings,  catalogs,  advertisements,  reports,  letters,  card 
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catalogs,  museum  and  exhibition  displays.  The  modern  conditions  of  op¬ 
eration  with  documents  require  approaches  which  are  new  in  principle. 
Therefore  the  introduction  of  the  ideas  and  methods  of  cybernetics  into 
documentation  practice  has  led  to  the  creation  of  a  new  scientific  dis¬ 
cipline  -  documentalistics .  Its  basis  is  one  of  the  divisions  of  cyber¬ 
netics  -  the  theory  of  information  systems.  This  trend  is  being  devel¬ 
oped  at  the  Center  for  Studies  in  Documentation  and  Communication  (USA\ 
the  Documentation  Center  of  the  National  Institute  of  Scientific  Re¬ 
search  (Prance),  the  International  Documentation  Center  in  Stockholm, 
the  International  Federation  on  Documentation  (the  former  International 
Inrtit-.’*e  of  Documentation),  the  Documentation  Institute  of  GDR,  the 
VINITI  and  the  Scientific  Council  on  Cybernetics  of  the  Academy  of  Sci¬ 
ences  USSR.  The  investigation  materials  are  published  In  "American  Doc¬ 
umentation,"  "Nachrichten  fuer  Dokumentation,"  "Dokumentation, "  "IRE 
Transactions  on  Engineering  Management,"  "Nauchno-tekhnicheskaya  infor- 
matsiya,"  "Zavodskaya  laboratoriya, "  and  others.  There  is  a  gradual 
recognition  of  the  fact  that  without  cybernetics  -  the  science  of  con¬ 
trol  and  organization  -  the  value  of  machines  in  the  field  of  produc¬ 
tivity  of  mental  work  cannot  in  itself  be  high  [10]. 

A  new  practical  profession  has  appeared  -  documentalist  [ 14 ,  26, 

33 ,  31*],  who  in  contrast  with  the  librarian  or  archivist  is  first  of 
all  a  specialist  in  that  field  of  knowledge  which  the  information  ser¬ 
vice  deals  with,  who  actively  utilizes  such  forms  of  documentation  as 
bound  clippings  and  photocopies,  punched  cards  and  microreproductior.s . 
The  documentalist  is  faced  with  broad  problems,  which  can  be  formulated 
in  10  basic  points: 

1)  furnish  the  user  all  required  information; 

2)  furnish  information  from  adjacent  fields  which  are  of  indirect 
interest ; 
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3)  provide  timely  Information,  i.e.,  not  when  the  need  for  the  in¬ 
formation  has  already  passed  or  has  not  yet  arisen; 

*J)  and  in  the  shortest  possible  form; 

5)  provide  additional  information  without  any  effort  on  the  part 
of  the  user; 

6)  indicate  the  level  of  credibility; 

7)  know  the  source; 

8)  report  possible  delays  in  obtaining  the  information; 

9)  do  not  provide  unnecessary  information  (combat  redundancy); 

10)  aid  the  specialist  in  formulating  his  scientific  and  engineer¬ 
ing  reports  by  extensive  use  of  the  survey  forms  of  documentation. 

The  reorganization  of  the  information  service  in  order  to  satisfy 
these  conditions  is  impossible  without  special  preparation  of  personnel 
and  special  assignments.  During  the  years  of  the  appearance  and  rapid 
development  of  documentalistics  in  the  USA  the  average  amount  assigned 
for  scientific  information  increased  from  2%  (195*0  to  5%  (1959)  and, 
according  to  the  experts,  will  increase  again  by  a  factor  of  2.5  [35, 
36].  The  average  number  of  specialists  per  information  worker  in  1956- 
1958  was  51*  [37]  and  has  a  tendency  to  decrease.  In  the  opinion  of  the 
Soviet  researchers,  the  optimal  level  of  expenditures  on  information 
search  should  amount  to  1/5  of  the  expenditures  on  science  [38]. 

Theory  of  Documental  Information  Systems 

The  information  service  in  the  human  society  is  a  complex  of  sim¬ 
ple  and  complex  information  systems,  each  of  which  includes  such  stages 
of  the  information  process  as  information  collection,  primary  process¬ 
ing,  transmission,  secondary  processing,  storage,  search  and  selection, 
correlation,  duplication,  and  distribution.  With  respect  to  objective 
the  systems  are  divided  into  reference,  current  accounting,  diagnostic, 
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correlation,  educational,  game,  etc.,  to  each  of  which  the  fundamental 
cybernetic  parameters  are  applied  -  operating  speed,  reliability,  adap¬ 
tivity,  life,  compactness,  economy  [3]. 
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Pig.  3-  Overload  of  information  systems.  1)  Region  of  normal  operation; 
2)  overload  region;  3)  failure  region,  a)  Capacity  of  information  sys¬ 
tem;  b)  information  output,  bits/time;  c)  information  input,  bits/time. 

The  systems  are  intended  for  encompassing  information  flows  and 
arrays  which  are  characterized  by  the  capacity  and  dynamics .  Insuffi¬ 
cient  operating  speed  may  lead  to  overloading  and,  as  a  result,  to  re¬ 
duction  of  the  operational  liability  leading  to  temporary  and  complete 
failure  (Pig.  3).  The  other  system  parameters  are  related  in  the  same 
fashion.  For  example,  with  limitation  of  the  mechanical  speeds  we  can 
Increase  the  actual  operating  speed  at  the  expense  of  compactness,  etc. 

The  element  of  documental  information  is  the  document,  which  con¬ 
tains  a  definite  number  of  information  units  (bits).  The  information 
object  in  the  systems,  other  than  documents,  may  be  people,  enterpris¬ 
es,  materials,  equipment,  products,  etc.  Each  of  the  objects  is  charac¬ 
terized  by  a  definite  set  of  information  labels s  among  which  we  identi¬ 
fy  the  so-called  search  labels.  Taken  together,  they  form  the  search 
image  of  the  document,  enabling  us  to  find  rapidly  the  required  docu¬ 
ment  in  a  large  set  of  objects.  Ths  set  of  all  search  labels  of  a  sys¬ 
tem  is  termed  the  search  matrix,  for  which  the  number  of  cells  is  in  a 
direct  relationship  with  the  number  of  objects.  The  term  "density"  de¬ 
notes  the  number  of  cells  per  unit  volume  or  area  of  the  matrix.  The 


matrix  operates  with  a  binary  code,  i.e.,  each  cell  is  assigned  a  defi¬ 
nite  label  and  relative  to  each  object  the  cell  responds  to  an  interro¬ 
gation  by  "yes"  (1)  or  "no”  (0).  With  this  very  simple  arrangement  of 
the  system  (direct  coding),  when  all  the  labels  are  parallel  and  may  be 
.  encountered  in  the  objects  in  all  possible  combinations,  the  informa¬ 

tion  capacity  of  the  object  matrix  and  of  the  system  as  a  whole  is  the 
same.  Mutually  exclusive  labels  within  the  limits  of  a  single  aspect 
permit  the  use  of  combination  coding,  when  the  search  label  is  assigned 
to  a  definite  combination  of  cells;  in  this  case  the  information  capac¬ 
ity  of  the  system  increases  sharply,  while  the  capacity  of  the  object 
matrix  diminishes  correspondingly  (up  to  1).  In  a  demographic  system 
(where  the  information  object  is  the  human  being)  an  example  of  aspects 
with  mutually  exclusive  labels  are  the  surname,  given  name,  patronymic, 
year  of  birth,  nationality,  residence.  With  combination  coding  use  is 
made  of  different  keys  which  facilitate  the  formulation  and  operation 
of  the  system. 

Systems  for  current  accounting  are  characterized  by  the  fact  that 
the  number  of  information  objects  increases  more  rapidly  than  the  num¬ 
ber  of  labels.  In  the  reference  systems  this  increase  is  approximately 
the  same,  and  a  set  of  stereotype  schemes  is  suggested  for  the  search. 
The  diagnostic  systems  (task:  identify  an  unknown  object  with  a  set  of 
known  objects)  are  a  variety  of  the  reference  systems,  with  the  differ¬ 
ence  that  hear  the  search  strategy  is  essentially  a  sequential  task. 

The  correlation  systems  do  not  have  the  objective  of  searching  out  a 
required  object,  but  permit  performing  a  broad  comparison  of  various 
labels  in  order  to  correlate  them. 

Sometimes  the  information  objects  and  the  labels  exchange  places. 
In  this  case  we  speak  of  the  label  search  matrices  and  term  such  sys¬ 
tems  "inverted."  An  example  are  the  comprct  card  catalogs  of  archive 
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materials,  where  the  card  is  entered  not  by  document  but  by  label  and 
fixes  the  number  of  the  corresponding  documents. 

Prom  the  point  of  view  of  mathematics  any  information  system  is  a 
mathematical  table  with  a  certain  number  of  inputs  -  from  one  (single¬ 
aspect  systems)  to  several  tens  and  hundreds.  In  the  case  of  the  sin¬ 
gle-aspect  system  we  have  an  infinitely  long  tape  which  may  be  cut  into 
individual  "frames”  (objects)  and  a  conventional  card  catalog  may  be 
compiled.  The  "frames"  are  arranged  in  a  strict  order  in  the  tape  and 
in  the  card  catalog.  The  only  difference  Is  that  they  cannot  be  removed 
from  the  tape  and,  consequently,  it  is  difficult  to  supplement  and  re¬ 
new  them.  With  the  card  principle  we  can  add  to  the  system  and  select  a 
search  scheme  such  that  we  do  not  need  to  pass  the  entire  array  through 
the  search  device,  but  in  this  case  the  danger  of  disappearance  of  in¬ 
formation  increases  markedly,  so  that  much  time  and  attention  is  re¬ 
quired  not  only  to  find  the  card  but  also  to  put  it  in  its  place;  in 
large  arrays  the  incorrect  return  of  a  card  is  equivalent  to  its  de¬ 
struction.  An  example  of  a  table  with  t*;o  inputs  are  the  conventional 
(traditional)  tables  consisting  of  rows  and  columns,  in  which  finding 
the  required  cells  Is  accomplished  by  the  coordinate  method.  The  design 
of  multi-input  tables  (multi-aspect  systems)  has  some  fundamental  char¬ 
acteristic  features  and  is  based  on  the  fact  that  the  input  function  is 
performed  by  all  the  cells  of  the  table  (matrix).  Thus,  the  card,  or 
the  tape  "frame"  becomes  a  point  of  a  multi-dimensional  information 
apace,  developed  onto  a  plane  [*♦],  and  becomes  a  part  of  the  technical¬ 
ly  realized  information  system.  At  the  present  time  there  are  punch, 
magnetic,  and  optical  methods  of  realizing  such  systems. 

The  study  of  the  comparative  efficiency  of  the  documental  systems 
shows  that  the  difference  between  the  number  of  objects  and  the  search 
labels  must  not  exceed  two  orders.  So  far  schemes  have  been  developed 
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only  for  simple  systems  with  a  number  of  objects  up  to  10,000  and  a 
number  of  labels  up  to  100  or  a  little  more.  The  operation  of  a  system 
with  millions  of  objects  requires  the  use  of  10,000  labels,  operation 
with  which  encounters  difficulties  in  the  classification  aspect.  The 
way  out  of  this  situation  may  be  the  compilation  of  the  so-called  de¬ 
scriptor  glossaries  and  the  development  of  local  decimal  classifica¬ 
tions,  LDK,  which  have  quite  flexible  connection  with  one  another  with¬ 
in  the  framework  of  relatively  large  information  services.  The  use  of 
the  universal  classifications  (of  the  UDK  type)  is  increasingly  consid¬ 
ered  inefficient  [3*0  for  the  following  basic  reasons:  such  classifica¬ 
tions  are  constructed  using  the  differentiated  principle  and  do  not  re¬ 
flect  the  process  of  integration  in  science;  the  new  and  progressive 
branches  of  knowledge  acquire  a  longer  classification  index  than  the 
older,  traditional  branches;  the  processes  of  developing  classifica¬ 
tions  and  their  utilization  are  associated  with  major  difficulties 
which  make  the  information  system  unreliable  and  nonadaptive,  providing 
a  poor  answer  to  today's  problems. 

So  far  we  have  discussed  only  the  search  labels  whose  objective  is 
to  provide  for  the  retrieval  of  the  required  information.  Wn«n  the 
search  is  accomplished  the  system  refers  the  user  to  a  separately  lo¬ 
cated  array  of  documents  arranged  by  number.  This  method  of  search  is 
termed  two-stage.  Sometimes  it  is  economically  advantageous  to  intro¬ 
duce  the  uncoded  information  directly  into  the  search  system  in  order 
that  the  search  operations  deliver  immediately  the  required  documents 
or  their  copies.  In  this  case  the  problems  are  solved  not  only  speedily 
but  also  compactly,  since  there  is  no  need  to  utilize  extensive  librar¬ 
ies  and  archives. 
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Mechanization  and  Automation  of  Information  Systems 

The  tasks  of  Integrated  mechanization  of  Information  systems  in¬ 
clude  operational  search  on  the  basis  of  the  punch,  magnetic,  and  opti¬ 
cal  forms  of  search  matrices,  the  creation  of  magnetic  matrices  with 
high  density  of  code  storage  (large  electronic  computer  memory),  the 
microreproduction  of  documents  for  compact  storage,  fast  duplication 
and  readout,  provision  for  inexpensive  machine  translation  of  text  from 
one  language  to  another,  the  introduction  of  new  methods  for  transmit¬ 
ting  documental  information,  etc. 

The  punch  forms  of  matrices  have  the  lowest  code  storage  density 
and  are  the  cheapest.  Here  the  basis  is  paper  (cardboard)  in  the  form 
of  punched  card  or  punched  tape,  which  have  their  advantages  and  disad¬ 
vantages,  mentioned  above,  and  which  are  quite  widely  used  at  the  pres¬ 
ent  time  [40,  41].  However  the  punch  card  method  is  more  highly  devel¬ 
oped  and,  apparently,  is  receiving  general  recognition  for  use  in  sim¬ 
ple,  nonautomated  systems. 

With  respect  to  method  of  coding  the  perforated  cards  are  divided 
into  punched  and  notched.  In  the  first  case  the  perforation  openings  of 
the  matrix  are  punched  in  the  coding  process;  in  the  second  case  they 
are  made  ahead  of  time  and  are  notched  out  during  coding  (the  notches 
are  completed  to  the  edge  or  are  connected  with  one  another).  With  re¬ 
spect  to  location  of  the  openings  we  differentiate  cards  with  internal 
and  external  (edge)  perforations. 

The  punched  cards  with  internal  perforations  have  been  widely  used 
as  the  so-called  machine  punch  cards  [2,  42-45]  of  standard  format, 

18?. 4  x  82.5  mm,  whose  matrix  consists  of  80  columns  with  10  basic  (0, 
1»  2,  3,  4,  5,  6,  7,  8,  9)  and  2  additional  cells  in  each  column,  which 
constitutes  in  all  960  cells.  In  addition  to  these  80-column  cards,  use 
is  made  (considerably  more  rarely)  of  90-column,  45-column,  etc.  The 
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perforators  used  to  punch  the  required  openings  are  designed  similar  to 
typewriters  and  operate  with  approximately  the  same  speed,  which  as  the 
result  of  a  memory  unit  may  be  Increased  when  duplicating  perforations 
of  the  same  type  [46,  47].  A  specialized  reproduction  device  provides 
an  output  of  100-120  cards/min  [49].  Recently  portable  (pocket)  types 
of  perforators  operating  on  the  principle  of  the  automatic  pencil  [48] 
have  cone  Into  widespread  use.  Sorting  of  the  punched  cards  Is  accom¬ 
plished  at  a  rate  of  25,000-60,000  cards/hr  [46].  A  special  card  selec¬ 
tor  processes  simultaneously  two  arrays  of  cards  at  a  rate  of  250-650 
cards/min  [49],  In  addition  to  these  devices,  the  complex  of  the  per¬ 
forating  machines  include  special  perforators  operating  from  pencil 
marks,  verifiers,  tabulators,  decoders  (60-65  cards/min),  and  card 
counters  (100  cards/min)  [47,  49]. 

The  notched  cards  do  not  require  complex  mechanical  preparation: 
the  working  tools  are  quite  simple  -  scissors  for  cutting  out  openings 
and  knitting  needles  for  selecting  the  required  cards  [50].  The  modest 
technical  equipment,  usually  not  exceeding  in  complexity  a  typewriter  - 
shchiptsy-komposter ,  notching  machines,  electro-vibration  selectors, 
etc.  -  provide  some  time  saving  in  coding  and  multi-aspect  searches, 
which  require  simultaneous  use  of  several  matrix  cells.  The  advantages 
of  the  edge-punched  cards  are:  simplicity  and  clarity  of  operation,  the 
presence  of  a  large  field  for  uncoded  information,  the  possibility  of 
manual  preparation.  The  advantages  of  the  cards  with  internal  perfora¬ 
tions  (they  are  also  termed  slotted  cards  -  from  the  dominating  form  of 
cutout)  are:  somewhat  greater  capacity  of  the  matrix,  and  the  possibil¬ 
ity  of  performing  multi-aspect  search  in  a  single  pass. 

A  special  type  are  the  so-called  transparent,  superposition 
punched  cards,  which  usually  operate  using  the  inverse  principle:  the 
card  is  set  up  on  the  basis  of  a  single  label  and  each  cell  of  the  ma- 
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trix  corresponds  to  a  single  object;  superposing  one  card  on  another 
leads  to  alignment  of  the  openings  corresponding  to  objects  with  the 
given  set  of  labels  [51].  Recently  the  combi.ned  types  of  punched  cards 
have  become  popular:  machine  cards  used  as  transparent  [52,  53] ,  slot¬ 
ted  cards  used  as  transparent  [54],  transparent  with  edge  perforation 
[55,  56]. 

The  diagram  of  the  relationship  of  the  various  types  of  punched 
card  systems  with  respect  to  rumber  of  objects  and  labels  (Fig.  4) 
shows  that  they  compete  very  little  with  one  another  and,  as  a  rule, 
mutually  supplement  one  another.  Cases  are  known  when  the  rapid  growth 
of  the  information  array  led  to  replacement  of  manual  punched  cards  by 
machine  cards.  However,  the  Powers  Regulator  Co.  in  the  USA  considered 
it  advisable  to  go  the  other  way  and  converted  a  bibliographic  refer¬ 
ence  system  operating  electronically  with  80-column  cards  to  cards  with 
edge  perforation  in  order  to  use  more  efficiently  the  uncoded  (ab¬ 
stract)  material;  an  important  conclusion  was  drawn  at  the  same  time 
that  it  is  not  always  nor  in  all  cases  wise  to  use  digital  computers 
and  specialised  servicing  personnel  [57].  The  sorting  speed  for  punched 
cards  with  edge  perforation  is  usually  1/4  to  1/2  the  sorting  speed  of 
the  machine  cards,  and  with  arrays  up  to  100,000  units  the  manual  sort¬ 
ing  may  be  preferable  [58].  Of  considerable  importance  here  is  a  factor 
such  as  the  rate  of  usage  (loading)  of  the  system  [59].  Stanford  Re¬ 
search  Institute  in  the  USA  carried  out  special  experiments  which 
showed  that  for  a  volump  of  the  Information  flow  of  40,000  documents 
per  month  the  manual  punched  cax’ds  are  preferable  if  the  number  of 
searches  per  month  does  not  exceed  250;  with  1000  searches  per  month 
the  machines  Justify  themselves  if  the  volume  of  the  information  flow 
does  not  exceed  7-8000  (Fig.  5). 

To  date  the  optical  forms  of  matrices  do  not  have  great  practical 
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Fig.  4.  Relationship  of  the  various  types  of  punched  card  systems  with 
respect  to  number  of  objects  and  labels.  1)  Machine  punched  cards;  2) 
transparent  punched  cards;  3)  punched  cards  with  edge  and  internal  per¬ 
forations;  4)  transparent  punched  cards  with  edge  perforations,  a)  Ob¬ 
jects;  b)  labels. 

importance.  They  are  combinations  of  black  (nontransparent)  and  white 
(transparent)  points  on  a  photographic  surface  occupying  half  or  one- 
third  the  area  of  microcards.  In  the  case  of  microfilms  the  matrix  is 
photographed  along  the  edges  (in  the  manner  of  a  sound  track)  or  in  the 
spaces  between  the  frames  of  the  microtext.  The  advantages  of  the  opti¬ 
cal  method  are:  high  density  of  code  storage  and  long  life  of  the  coded 
information.  The  drawbacks  are:  relative  expense  and  the  technical  dif¬ 
ficulties  associated  v.'ith  coding  and  recoding.  It  appears  that  these 

shortcomings  may  be  avoided  by  the  method  of  thermoplastic  storage, 

2 

which  provides  a  density  of  6.2  million  cells/cm  [3J. 

An  example  of  a  fast  but  very  expensive  system  is  Minicard,  where 
a  16  x  32  mm  microcard  contains  the  image  of  12  pages  of  text  and  ma¬ 
trices  with  2730  cells.  Documentation  consisting  of  11  million  pages  is 
stored  on  900,000  cards.  The  system  operating  speed  is  1800  cards/min. 

Microfilm  devices  with  revolving  endless  tape  (of  the  type  of  Rap¬ 
id  Access  Look-Up,  Poisk,  etc.)  have  limited  application  because  of  the 
inadequate  number  of  information  objects  and  the  inadequate  volume  of 
the  matrix,  and  also  because  of  the  difficulties  associated  with  addi- 
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Fig.  5-  Economics  of  machine  and  manual  (punch  cards  with  edge  perfora¬ 
tion)  information  systems.  1)  Operational  costs,  millions  of  dollars; 

2)  manual  punch  cards;  3)  number  of  searches  per  month;  lJ)  volume  of 
information  flow,  number  of  documents  per  month. 


tion  to  and  renewal  of  the  documental  information.  As  a  rule  they  are 
used  in  commerce  and  reproduce  commercial  reference  materials,  provid¬ 
ing  the  capability  of  finding  the  required  page  in  1.5  seconds. 

The  magnetic  method  has  begun  to  play  a  major  role  in  the  elec¬ 
tronic  information- logic  machines  and  has  far  from  exhausted  its  capa¬ 
bilities,  particularly  with  relation  to  magnetic  cards  which  contain, 
like  the  optical  cards,  a  microimage  of  the  pages  of  the  documents  and 
a  magnetic  matrix  (along  the  card  edges  or  in  a  block  at  one  side  —  in 
the  first  case  the  microimage  is  read  in  transmitted  light,  in  the  sec¬ 
ond  case  it  is  read  in  reflected  light).  The  advantages  of  this  method 
are:  comparatively  low  cost,  very  high  recording  density,  and  the  pos¬ 
sibility  of  systematic  rewriting  of  the  code.  As  a  result  of  the  pri¬ 
mary  shortcoming  -  unreliability  of  the  storage  -  the  electronic  infor¬ 
mation-logic  systems  using  the  magnetic  form  of  memory  are  forced  to 
store  the  primary  information  material  on  machine  cards  which  systemat¬ 
ically  copy  the  object  matrices  onto  the  magnetic  base. 

As  an  example  of  a  system  using  the  magnetic  method,  we  mention 
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Magnavue  (the  firm  Magnavox).  One-third  the  area  of  the  25-4  *  76.2  mm 
card  is  occupied  by  a  microphotocopy  of  the  text,  the  remaining  part  of 
the  card  is  occupied  by  a  matrix  of  capacity  3000  cells.  The  minimal 
service  life  of  the  card  is  200,000  scannings.  The  information  input 
rate  into  the  computer  is  90,000  svmbols/sec  [3]. 

The  struggle  for  high  speed  and  large  memory  of  the  information 
systems  is  not  limited  to  the  methods  just  mentioned.  In  the  USA  in  the 
near  future  information  stores  of  capacity  10-100  billion  cells  will  be 
built,  and  the  cost  per  cell  will  be  reduced  to  5  cents  using  magnetic 
drums  and  discs  [3].  Realistic  conditions  are  being  created  for  storing 
with  the  aid  of  electrical  impulses  on  a  thin  metallic  film  coating  a 
glass  plate  of  dimensions  13  *  18  cm  the  equivalent  of  the  information 
stored  by  the  human  memory  [50].  We  predict  a  great  future  for  the 
crystal  optics  forms  of  memory,  where  the  information  cells  are  the 
colored  defects  of  the  crystal  lattice. 

Along  with  these  directions,  problems  are  being  resolved  of  reduc¬ 
ing  (reduction  of  size  without  alteration  of  form)  uncoded  information. 
Today's  microphotoreproduction  techniques  permit  obtaining  efficiently 
microcopies  of  documents  reduced  by  a  factor  of  10-20,  permits  reading 
them  with  the  aid  of  desk-top  and  pocket  readers,  and  permits  making 
any  number  of  copies  of  normal  dimensions  [ 6 0—63 ] -  The  cost  of  the  mi¬ 
crocards  is  from  5  to  50%  of  the  cost  of  the  books  which  they  repro¬ 
duce  [1-4  3.  The  microcards  carry  a  magnetic  or  optical  form  of  search 
matrices  [33  or  are  combined  with  punched  cards  (aperture  and  klyasser- 
nyy  cards)  [2,  34,  64—68] ,  which  are,  therefore,  an  element  of  rela¬ 
tively  optimal  (cybernetic)  information  systems.  The  Verac  system  with 
the  aid  of  140-fold  reduction  permits  reproducing  on  25.4  x  25.4  cm 
film  10,000  pages  of  text;  the  information  stored  on  200  such  films  Is 
searched  in  2  seconds  and  then  is  transmitted  over  television,  coded  on 
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paper  in  normal  scale  or  on  photofilm  with  a  standard  reduction  [3]. 

The  Walnut  system  (IBM  firm)  in  2  passes  performs  a  reduction  to  1/1000 
of  the  natural  size.  In  each  of  100  modules  there  is  stored  200  plastic 
containers  with  50  micrccards.  The  search  is  performed  using  punched 
card,  punched  tape,  or  magnetic  methods  [3].  In  principle  it  is  possi¬ 
ble  (but  not  yet  technically  accomplished)  to  reproduce  on  a  sphere  the 
size  of  a  pin  head  the  text  which  is  equivalent  in  volume  to  2 4  volumes 
of  the  Encyclopedia  Britannica.  The  required  information  will  be  found 
by  the  coordinate  method  and  read  with  the  aid  of  an  electron  micro¬ 
scope  . 

Thus,  all  the  books  in  the  world  may  be  concentrated  in  the  volume 
of  a  pocket  notebook.  This  tendency  to  bring  the  information  near  the 
user  is  of  great  importance  in  selecting  a  technical  policy  in  the 
field  of  documental  information. 

Organization  of  the  Information  Services 

The  ideas  of  documentalistics  find  their  reflection  (elementary 
for  the  moment)  in  the  practice  of  the  information  services.  Library 
[69-71]  and  archive  [ 7 2—73 3 »  including  drawings  [3],  materials,  current 
newspaper  and  journal  information  [7^-75]  are  being  translated  onto  mi¬ 
crofilm.  The  world's  first  scientific  journal  on  microfilm,  bypassing 
the  conventional  method  of  printing,  has  begun  publication  and  its  pri¬ 
mary  advantages  are:  low  cost,  quick  publication,  retention  of  articles 
of  large  volume,  improved  quality  of  illustrations,  convenience  of 
storage  [76].  A  basically  new  form  of  publication  is  the  so-called 
electronic  book  -  "Computexts"  -  a  magnetic  tape  suitable  for  input  in¬ 
to  a  high-speed  digital  computer.  The  first  such  issue  is  entitled 
"Electronic  Index  of  Chemical  Patents"  (150,000  descriptions  covering 
17  years,  volume  0.028  m^)  [77].  The  American  Society  of  Chemical  En- 
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gineers  has  made  it  mandatory  to  provide  the  articles  published  in  its 
organs  with  descriptor  abstracts  which  are  suitable  for  entry  into  any 
information  system,  and  has  issued  in  two  volumes  a  specialized  de¬ 
scriptor  dictionary  (the  source)  on  chemistry  and  chemical  engineering 
(50,000  terms)  [78].  An  example  of  efficient  organization  of  library 
servicing  is  the  storing  of  documents  on  aperture  punched  cards  and 
their  transmission  over  specialized  television  channels  within  a  radius 
of  several  kilometers.  The  client  contacts  the  book  depository  by  tele¬ 
phone  and  at  an  agreed  time  switches  on  the  television,  equipped  with 
an  automatic  focusing  system  and  adaptors  for  turning  the  pages,  chang¬ 
ing  the  image  area  and  the  magnification  ratio  [79].  The  highly  auto¬ 
mated  electronic  systems  are  still  of  an  experimental  nature.  They  in¬ 
clude  the  information  service  of  the  Central  Information  Agency  of  the 
USA,  which  operates  on  the  Walnut  system  and  within  10  seconds  finds 
any  document  from  100  million  stored  documents  [80].  The  electronic  ma¬ 
chine  Sinoptikon  with  a  printer  providing  600  lines/min  compiles  a  bib¬ 
liography  on  a  given  subject  and  writes  out  citations  to  questions  — 
"family,"  "man,"  "god,"  "punishment,"  and  others  from  the  books  of  71* 
authors  and  the  Bible  [8l]. 

The  experience  accumulated  makes  it  possible  to  draw  certain  pre¬ 
liminary  conclusions.  First,  the  selection  of  limited-mechanization  or 
highly  mechanized  (automated)  systems  is  determined  primarily  by  two 
factors:  the  intensity  of  system  usage  and  the  nature  of  the  informa¬ 
tion  requirement.  In  purely  reference  systems,  which  produce  finished 
results,  the  level  of  mechanization  is  dictated  only  by  the  intensity 
of  utilization:  an  expensive  system  which  services  a  client  in  one  sec¬ 
ond  will  be  preferable  to  a  cheap  system  which  does  the  same  in  one 
hour,  if  we  must  satisfy  hundreds  of  requests  per  day.  In  bibliographic 
and  other  such  systems  which  provide  data  in  semi-finished  form  for 
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further  individual  processing,  limited  mechanization  is  required  (at 
each  writing  table)  or  a  combined  system  which  includes  limited  mechan¬ 
ization  in  the  last  stage. 

As  an  example  we  cite  the  optimal  organization  of  a  system  for 
medical  diagnostics  and  correlation  of  medical  (clinical)  experience. 
Each  practicing  doctor  maintains  his  own  personal  manual  punch-card 
catalog  of  disease  histories.  The  medical  information  center  collects 
duplicates  of  these  cards,  reproduces  them  and  sends  them  on  a  sub¬ 
scription  basis  to  all  interested  individuals  and  organizations,  who 
enter  them  into  their  own  narrowly  specialized  information  systems.  In 
turn  the  machine  duplicates  of  the  same  cards  are  input  to  a  single 
reference  automated  system  whose  primary  purpose  is  to  establish  prece¬ 
dents  and  select  data  from  them,  the  development  of  optimal  methods  cf 
diagnostics,  prophylactics,  and  treatment  of  diseases. 

For  the  first  time  VINITI  in  1963  began  to  publish  the  abstract 
Journal  "Metallurgiya"  on  manual  punched  cards,  which  permits  a  sharp 
increase  of  the  number  of  Individual  subscribers  to  the  abstract  vol¬ 
umes,  increases  the  profitability  and  efficiency  of  these  volumes.  Met¬ 
allurgiya  appears  in  23  individual  card  catalogs,  coded,  in  addition  to 
the  library-bibliographic  labels  (author,  publication  type,  branch, 
country,  card  file  number,  and  year),  by  100  basic  (constructed  using 
the  LDK)  and  **0  additional  subject  labels  (headings),  3000  headings  in 
all.  The  specialized  information  centers,  obtaining  information  in  the 
form  of  such  punched  cards,  recode  the  information  (without  altering 
the  form  of  the  stencil)  in  accordance  with  its  local  decimal  classifi¬ 
cation  (10-100  card  files  with  100  or  1^0  headings  in  each),  circulate 
the  information  according  to  the  requests  of  their  users  (laboratories, 
factories  and  individual  specialists),  and  send  the  cards  to  locations 
where  they  are  recoded  a  third  time  and  introduced  into  the  specialized 
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local  information  systems.  One  of  the  ten  card  files  on  general  ques¬ 
tions  of  a  given  field  is  used  directly  in  the  center  itself  for  the 
local  3rd  stage  system.  The  reference  service  of  this  center  involves 
the  coordination  of  the  operations  of  all  the  highly  specialized  sys¬ 
tems  and  getting  them  to  form  regular  reference  functions  on  request 
through  the  center. 

This  three-stage  information  system  [82]  facilitates  the  work  of 
the  information  services  by  separating  the  functions  of  information 
collection  and  processing,  replaces  the  tedious  universal  classifica¬ 
tions  by  adaptive  local  schemes  which  are  consonant  with  the  structure 
of  today's  economy,  permits  better  utilization  of  the  information  mate¬ 
rials  . 

The  issuance  of  secondary  information  on  punched  cards  may  be  con¬ 
sidered  as  the  operation  of  merging  two  information  processes:  ab¬ 
stracting  and  searching.  The  next  step  is  che  merging  of  the  primary 
and  secondary  information  so  that  the  issuance  of  each  publication 
would  simultaneously  input  it  into  the  existing  search  systems  [3^]. 
This  task  may  be  performed  by  punched  cards  or  other  new  forms  of  cy¬ 
bernetic  technology. 
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MICRO PHOTOREPRODUCTION  METHODS  OP 
REDUCING  DOCUMENTAL  INFORMATION 

S . A.  Lipkin 

Microphotocopying  —  the  production  of  reduced  ohotographic  copies 
of  documents  —  enables  the  solution  of  several  fundamental  problems 
concerning  the  mechanization  of  information  work.  Microphotocopying  is 
used  as  a  basis  for  providing  compact  storage,  creating  conditions 
suitable  for  mechanization  and  automation  of  information  search,  and 
solving  the  question  of  efficient  copying  and  duplication  of  documents. 
All  this  permits  a  new  approach  to  the  problems  of  publishing  and  li¬ 
brary  functions,  creating  more  flexible  and  efficient  systems  for 
bringing  information  to  the  reader. 

At  the  present  time  in  the  USSR  there  are  in  serial  production 
several  models  of  equipment  and  specialized  photographic  films  for  mi¬ 
crophotocopying,  which  provide  the  possibility  of  the  practical  intro¬ 
duction  of  this  process  in  many  fields  of  work  with  documents:  scien¬ 
tific  and  engineering  information,  design  drawing  documentation,  ar¬ 
chive  management,  etc. 

However,  in  the  interest  of  progress  of  this  very  promising  field 
of  technology  a  considerable  expansion  of  the  output  of  equipment  and 
photographic  materials  is  necessary. 

Modern  microreproduction  technology  is  characterized  by  a  tendency 
toward  obtaining  as  great  a  reduction  of  the  document  size  as  possible, 
large-scale  use  of  microphotocopying,  the  construction  of  systems  for 
mechanizing  and  automating  information  search  based  on  microphotocopies 
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of  documents,  the  design  of  facilities  for  efficient  copying  and  dupli¬ 
cation  of  documents. 

I.  REDUCTION  OP  DOCUMENT  DIMENSIONS  (INFORMATION  CONTRACTION) 

The  size,  volume,  and  correspondingly  the  weight  of  documents  are 

* 

reduced  during  microphotocopying  approximately  inversely  proportional 
to  the  square  of  the  reduction  factor  of  their  linear  dimensions.  If  a 
document  of  dimension  20  x  30  cm  is  reduced  by  microphotocopying  only 
by  a  factor  of  10  (which  is  elementary  with  the  modern  technical  capa¬ 
bilities),  then  its  dimensions  on  the  microphotocopy  become  2x3  cm, 
i.e.,  the  area  and  approximately  the  volume  are  reduced  by  a  factor  of 


100. 


Let  us  denote:  R  is  the  reduction  factor  of  the  linear  dimensions 
of  the  documents;  V m  is  the  volume  of  the  microphotocopies;  V ^  Is  the 
volume  of  the  documents;  A  is  a  coefficient  of  proportionality  which 
takes  into  account  the  relationship  of  the  thicknesses  of  the  document 
paper  and  the  photographic  film,  the  nature  of  the  folding  (presence  of 
air  spaces),  etc.  Then  the  equation  which  expresses  the  relation  be¬ 
tween  the  volume  of  the  documents  and  the  microphotocopies  takes  the 
form: 


V* 


The  same  relation  will  hold  between  the  weight  of  the  documents 
and  their  photocopies,  the  area  of  the  documents  S ^  and  of  the  micro 
photocopies  Sm,  i.e.: 


as  in  this  case  k  ■  1.  |  ' 

The  effect  which  may  be  achieved  by  reducing  the  dimensions  of  j 

documents  by  microphotocopying  is  obvious:  quite  realistic  technologi-  | 
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cal  capabilities  are  created  for  relieving  the  document  concentration 
centers  of  the  paper  storage  facilities  without  reducing  at  the  same 
time  the  completeness  of  the  information  which  these  centers  contain. 

It  should  be  noted  that  in  microphotocopying  not  only  is  an  abso¬ 
lutely  exact  content  of  the  documents  retained,  but  all  their  graphical 
peculiarities,  which  in  certain  cases  is  very  important. 

At  the  present  time  we  use  in  practice  a  reduction  of  up  to  20 
times  in  microphotocopying.  There  are  capabilities  (photographic  mate¬ 
rials  and  equipment  have  been  fabricated)  for  reductions  of  up  to  40 
times.  In  several  systems  for  automating  information  search  which  have 
been  developed  in  the  USA  use  is  made  of  photocopying  with  a  reduction 
of  the  original  documents  by  60,  100,  and  even  200  times  [1],  With  a 
reduction  of  200  times  the  microimage  of  the  page  will  have  dimensions 
of  approximately  1  *  1.5  mm,  and  2625  microimages  fit  on  a  microcard  of 
format  75  x  125  mm  (dimension  of  a  standard  bibliographic  card). 

By  using  high  density  of  document  spacing,  we  can  very  efficiently 
solve  the  problem  of  rapid  search  without  resorting  to  complex  devices. 
If  a  large  group  of  documents,  which  may  be  selected  on  a  common 
theme,  are  arranged  on  a  single  microcard,  then  the  search  problem  re¬ 
duces  to  finding  the  coordinates  of  the  site  of  the  required  document 
on  the  microcard. 

The  problem  of  achieving  high  reduction  factors  is  associated  with 
the  development  of  photographic  materials  and  optics  with  high  resolv¬ 
ing  powers.  Between  the  resolving  power  N  of  the  photographic  system 
"lens-photographic  material,"  used  for  microphotocopying,  the  reduction 
ratio  /?_,  and  the  minimal  line  width  on  the  document  d  there  exist  the 
following  relation: 

fi>2  N' 


-  43  - 


r 


i.e.,  the  line  width  on  the  microphotocopy  d/R  must  be  equal  to  or 
greater  than  the  minimal  width  of  the  line  which  the  photographic  sys¬ 
tem  is  capable  of  transmitting.  The  width  of  the  lines  of  the  various 
typographic  fonts  usually  varies  in  the  limits  from  0.1  to  0.25  mm. 

The  selection  of  the  optimal  resolving  power  of  the  photographic 
systems  "lens-photographic  material"  is  illustrated  by  the  following 
example:  for  a  minimal  width  of  a  line  on  the  document  equal  to  0.15 
mm,  when  preparing  microphotocopies  with  a  20-fold  reduction  it  is  nec¬ 
essary  that  the  resolving  power  (without  account  for  unavoidable  loss¬ 
es)  be  greater  than  70  lines/mm  (from  the  preceding  relation 

We  have  been  able,  using  the  new  Mikrat-positive  photographic  film 
with  resolving  power  470  lines/mm,  produced  by  the  Kazan  branch  of 
NIKFI ,  using  Soviet-made  optics,  to  obtain  quite  clear  images  of  typo¬ 
graphic  text  (line  width  0.15  mm)  with  a  reduction  greater  than  60- 
fold. 


II.  THE  TECHNOLOGY  OF  LARGE-SCALE  MICROPHOTOCOPYING 


Current  microreproduction  technology  is  based  on  highly  productive 
equipment  permitting  the  transfer  of  large  arrays  of  documents  to  mi¬ 
crophotocopies  in  a  short  time.  T’ne  microphotocopying  of  documents  is 
usually  performed  in  two  stages:  first  microphotocopy  negatives  are 
prepared,  from  which  positive  copies  are  then  printed.  The  negatives 
serve  for  the  printing  of  the  required  number  of  positives,  which  are 
used  as  the  working  material  for  retrieval  and  reading  of  the  docu¬ 
ments.  If  necessary  the  exposed  photographic  film  may  be  processed  us¬ 
ing  the  so-called  reverse  process,  permitting  the  production  of  a  posi¬ 
tive  image  without  subsequent  printing  on  positive  photographic  film. 
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The  reverse  process  of  handling  the  photographic  film  is  considerably 
more  complex  than  the  common  negative-positive  process,  and  the  princi¬ 
pal  difficulty  here  is  the  reproduction  of  the  microphotocopies,  which, 
as  a  rule,  is  required. 

With  the  availability  of  high-quality  micrcphctocopies  the  storage 
of  the  original  documents  is  unnecessary,  since  the  complete  and  exact 
content  of  the  documents  always  may  be  reproduced  from  the  microphoto¬ 
copies  . 

1.  Photographic  Materials  for  Microphotocopying 

As  the  photographic  materials  for  microphotocopying  we  use  the 
specialized  so-called  Mikrat  photographic  films  produced  by  the  Soviet 
movie  film  industry,  primarily  at  the  V.V.  Kuybyshev  Chemical  Plant  in 
Kazan. 

All  types  of  Mikrat  films  have  a  noncombustible  (triacetate)  base. 
Prior  to  1962  a  single  type  of  negative  Mikrat  photographic  film 
Mikrat-200  was  produced  having  a  resolving  power  of  200  lines/mm.  Seri¬ 
al  production  of  the  Mikrat-300  film  began  in  1962.  At  the  end  of  1963 
experimental  lots  of  two  new  Mikrat  photographic  films  -  Mikrat-300B 
and  Mlkrat-positive  became  available. 

In  contrast  with  Mlkrat-300,  the  Mikrat-300B  has  increased  light 
sensitivity  and  a  dark  antihalo  layer.  This  photographic  film  is  in¬ 
tended  for  micro  filming  equipment  for  dynamic  pictures. 

The  Mikrat-positive  photographic  film  is  used  for  contact  printing 
of  positive  microphotocopies.  It  is  also  produced  with  a  special  anti- 
curl  layer,  providing  flatness  of  the  photographic  film,  which  is  ex¬ 
tremely  important  in  the  preparation  of  microcards  (format  pieces  of 
photographic  film  with  mlcrophotocopies  of  the  documents  arranged  on 
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them) . 

The  following  are  the  characteristics  of  certain  types  of  Mikrat 
photographic  films. 


Characteristics  of  Mikrat  Films 
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1)  Type  of  photographic  film;  2)  producer;  3)  production  date;  4)  emul¬ 
sion  number;  5)  characteristic  of  photographic  film;  6)  light  sensitiv¬ 
ity,  units  per  GOST;  7)  contrast;  8)  resolving  power,  lines/mm;  9)  max¬ 
imal  density;  10)  emulsion  layer  thickness,  microns;  11)  Mikrat;  12) 
plant;  13)  June;  1^)  Kuybyshev  Chemical  Plant  in  Kazan;  15)  July;  16) 
same;  17)  December;  18)  Mikrat-positive. 
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The  Soviet  Mikrat  photographic  films  are  equivalent  in  quality  to 


the  best  foreign  photographic  films  for  the  same  purpose. 

Dlazo  film.  Ir.  contrast  with  the  Mikrat  photographic  films  de¬ 
scribed  above  which  contain  in  the  emulsion  silver  halide  which  is  sen¬ 
sitive  to  light,  the  light  sensitivity  of  the  diazo  films  is  the  result 
of  complex  organic  substances  -  diazo  compounds,  which  decompose  under 
the  action  of  light  [2,  33*  The  diazo  films  have  very  low  light  sensi¬ 
tivity  in  the  ultraviolet  portion  of  the  spectrum,  therefore,  they  may 
be  processed  in  a  light  room.  The  diazo  films  have  high  resolving  pow¬ 
er,  exceeding  the  resolving  power  of  the  best  Mikrat  photographic  films 
and  have  high  contrast. 

Wet  processes,  associated  with  subsequent  long-duration  drying, 
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are  not  required  for  developing  the  dlazo  films.  Their  cost  is  signifi¬ 
cantly  lower  than  the  cost  of  the  silver  halide  films.  These  character¬ 
istics  make  the  diazo  films  very  promising  for  preparing  positive  mi¬ 
crophotocopies.  This  method  may  be  used  for  efficient  duplication  of 
documents  directly  at  the  storage  location. 

Kalvar  photographic  film.  The  light  sensitivity  of  the  Kalvar  pho¬ 
tographic  film  is  also  the  result  of  diazo  compound  content  [4],  The 
nitrogen  atoms  which  are  released  during  their  decomposition  under  the 
action  of  light  remain  within  the  layer;  after  a  slight  short-term 
heating  the  layer  softens  and  the  nitrogen  bubbles  depart,  leaving  a 
trace  (cavity).  The  light  refraction  coefficients  are  different  on  the 
segments  of  the  layer  where  the  cavities  have  been  formed  and  where 
they  have  not.  Depending  on  the  nature  of  the  illumination  a  negative 
or  positive  image  is  obtained.  The  resolving  power  of  the  Kalvar  photo¬ 
graphic  film  is  1000  lines/mm. 

Photochromatlc  materials.  The  photochromatic  materials.  The  photo- 
chromatic  materials  permit  obtaining  microphotocopies  with  very  large 
reduction  ratios.  In  comparison  with  the  silver  halides,  the  photochro¬ 
matic  materials  have  several  important  advantages:  there  is  no  graini¬ 
ness  at  all,  a  very  high  reduction  ratio  is  provided,  very  high  resolv¬ 
ing  power,  the  capability  of  transferring  from  ‘'mall  to  very  high  den¬ 
sities,  the  image  becomes  visible  immediately  after  exposure  and  does 
not  require  development,  the  resulting  image  is  "reversible"  -  it  may 
be  easily  removed  and  again  impressed  on  the  same  segment  of  the  photo¬ 
graphic  layer. 

The  production  of  the  Image  on  the  photochromatlc  materials  is 
based  on  the  property  of  these  materials  to  alter  their  color  when  il¬ 
luminated  by  light  of  the  correspond! ng  portion  of  the  spectrum;  if 
then  the  resulting  image  Is  again  illuminated  by  light,  of  a  different 
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portion  of  the  spectrum  or  the  light  sensitive  layer  is  heated  slightly 
the  image  disappears.  The  reversibility  of  the  process,  the  removal  and 
repeated  exposures  on  the  same  segment  of  the  layer,  is  based  on  this 
property.  Along  with  these  valuable  qualities  the  photochromatic  mate¬ 
rials  have  an  essential  shortcoming  -  poor  retention  of  the  image:  un¬ 
der  room  temperature  conditions  the  image  is  retained  only  a  few  hours, 
and  at  low  temperature  a  few  months  or  even  a  year. 

2.  Equipment  and  Technology  for  Mass  Production  of  Roll  Mlcrophoto- 
coples 

The  technological  process  of  the  production  of  microphotocopies  is 
made  up  of  the  operations  of  photographing  the  documents,  development, 
inspection  and  mounting  of  the  negatives,  printing  and  development,  in¬ 
spection  and  mounting  of  the  positives. 

Photographing  of  the  documents  is  performed  on  specialized  micro¬ 
filming  equipment.  There  are  two  types  of  such  equipment:  static  film¬ 
ing,  when  the  filming  is  accomplished  with  a  stationary  original  (docu¬ 
ment)  and  stationary  photomaterial,  and  dynamic  (slit)  filming,  accom¬ 
plished  with  continuous  and  synchronous  motion  of  the  original  and  the 
photographic  material. 

Figure  1  shows  a  general  view  of  the  UDM-2  static  type  microfilm¬ 
ing  apparatus  produced  in  series  by  the  Odessa  Plant  of  KINAP.  The  doc¬ 
uments  in  the  form  of  individual  sheets  or  bound  in  a  book  are  placed 
under  the  glass  on  the  illuminated  screen  1,  the  plate  2  presses  the 
surface  of  the  pages  against  the  glass.  The  light  from  the  lamp  3  is 
reflected  from  the  surface  of  the  page  being  photographed,  passes 
through  the  lens  of  the  camera  and  strikes  the  light-sensitive  layer 
of  the  photographic  film. 

The  UDM-2  unit  uses  roll  perforated  or  unperforated  photographic 
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film  of  either  16  or  35  mm  width.  The  camera  magazine  is  designed  to 
hold  up  to  30  m  of  photographic  film.  The  reduction  ratio  during  micro¬ 
reproduction  may  be  varied  smoothly  from  5  to  20.  Focusing  during 
change  of  the  reduction  ratio  is  accomplished  automatically.  The  maxi¬ 
mum  format  of  the  document  being  photographed  is  407  *  576  mm.  The  set¬ 
up  Is  adapted  for  semiautomatic  operation  (the  operator  simply  turns 
the  pages);  the  productivity  depends  on  the  nature  of  the  documents  be¬ 
ing  photographed  and  is  approximately  2500  pages  per  shift. 
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Fig.  1.  UDM-2  microfilming  set¬ 
up.  1)  Screen;  2)  small  table 
for  books;  3)  illuminating 
lamps;  4)  camera. 


Among  other  microfilming  units  of  the  static  type  we  might  name 
the  Dokumator  (models  IV  and  V)  produced  in  GDR  by  the  nationalized 
firm  Karl  Zeiss.  This  unit  is  designed  to  use  35  mm  roll  movie  film. 

The  maximal  reduction  ratio  is  26;  the  maximal  format  of  the  document 
being  photographed  is  8l4  *  1152  mm. 

The  Soviet  microfilm  unit  RUST-3  i8  produced  for  microphotocopying 
of  drawings  and  other  documents  of  large  format  (to  8l4  x  n^>2  nun).  The 
photographing  id  accomplished  on  roll  urperforated  photographic  film  of 
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70  or  35  mm  width.  The  maximal  reduction  ratio  is  12.8. 

The  microfilming  equipment  for  dynamic  filming  differ  externally 
from  the  microfilming  units  for  static  filming.  They  have  no  screen,  no 
external  illumination,  they  are  considerably  smaller  in  size. 

The  principle  of  dynamic,  or  slit,  filming  consists  in  the  illumi-  ‘ 
nation  and  photographing  at  each  moment  of  a  narrow  strip  (slit)  on  the 
original,  which  moves  continuously  with  various  segments  of  the  surface 
falling  in  the  illuminated  field.  The  light  reflected  from  the  original 
passes  through  the  lens  onto  the  light-sensitive  layer  of  photographic 
material,  which  moves  synchronously  with  the  original;  the  photographic 
film  moves  slower  by  a  factor  equal  to  the  reduction  factor. 

The  equipment  for  dynamic  filming,  having  smaller  dimensions  and 
requiring  considerab?.y  less  electrical  power,  provide  a  productivity 
several  times  that  of  the  static  filming  units.  The  dynamic  units  may 
be  used  to  photograph  originals  in  the  form  of  individual  sheets  and 
the  length  of  the  photographed  original  is  unlimited. 

The  Moscow  Design  Bureau  for  Cine  Equipment,  MKBK,  has  developed 
three  types  of  dynamic  filming  equipment.  The  NM-1  unit  operates  as 
follows.  The  stack  of  documents  is  placed  on  the  self-feeder,  which 
feeds  them  one  after  another  over  a  transporter  the  filming  slit. 

The  rate  of  movement  of  the  documents  (apparatus  productivity)  is  0.6 
m/sec,  i.e.,  in  one  hour  up  to  9000  sheets  of  length  200  mm  may  be  pho¬ 
tographed.  The  maximal  width  of  the  photographed  document  is  300  mm, 
the  length  is  unlimited. 

Text  located  on  both  sides  of  a  sheet  may  be  photographed  in  a 
single  pass  of  the  document.  The  photography  is  accomplished  on  a  sin¬ 
gle  or  simultaneously  on  two  roll  photographic  films  of  width  up  to  16 
mm.  The  reduction  ratio  is  24,  32,  and  43. 

The  NM-2  microfilming  unit  (Pig.  2)  is  intended  for  photographing 
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on  roll  photographic  film  of  width  35  or  70  mm  engineering  drawing 
originals  and  other  large-format  documents.  The  width  of  the  photo¬ 
graphed  original  may  be  up  to  850  mm.  Filming  may  be  accomplished  in 
reflected  or  transmitted  (for  transparent  originals)  illumination.  The 
reduction  ratio  using  photographic  film  of  70  mm  width  is  10  or  13,  for 
35  mm  film  the  reduction  ratios  are  19  or  26.  The  rate  of  travel  of  the 
document  through  the  photographic  slit  (unit  productivity)  is  0.3  m/ 
sec. 
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Fig.  2.  NM-2  microfilming  unit. 


Figure  3  shows  the  NK-3  portable  microfilming  equipment.  It  occu¬ 
pies  an  area  of  270  x  1J10  mm  and  has  a  height  or  330  mm.  The  width  cf 
the  photographed  documents  (in  the  form  of  individual  sheets  or  small 
notepads)  is  up  to  300  mm,  the  length  is  unlimited.  The  unit  Is 
equipped  for  photographing  in  reflected  or  transmitted  (tracing  paper) 
light.  The  document  rate  of  travel  is  0.3  m/sec.  The  photographic  film 
width  is  16  or  35  mm;  the  reduction  ratio  Is  0  and  20. 

Still  another  unit  for  dynamic  microfilming,  AShM-1300,  has  ap¬ 
peared  recently,  intended  for  microphotocopying  of  documents  of  very 
large  format  -  width  to  1300  mm  with  unlimited  length. 

After  filming  the  photographic  filr  is  subjected  to  chemical  proc- 
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easing  in  solutions,  washing,  and  drying.  The  Mikrat  photographic  films 
are  processed  in  ordinary  concentrated  developing  solutions  for  black 
and  white  photographic  materials.  The  developing  time  is  ^-8  min;  after 
developing  the  film  is  rinsed  in  water  or  immediately  placed  in  a  fix- 
ing  solution  and  later  washed.  Very  careful  washing  is  required  in  or¬ 
der  to  provide  storage  for  several  years,  in  order  to  practically  com¬ 
pletely  'remove  the  remains  of  the  fixing  salt.  Washing  of  the  photo¬ 
graphic  film  continues  for  about  10  minutes  in  tap  water.  After  this 
the  film  is  dried  in  room  conditions  or  in  a  drying  box  using  slightly 
heated  air. 
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Pig.  3*  NM-3  microfilming  equip¬ 
ment  , 

The  chemical  processing  of  the  roll  photographic  film  is  performed 
in  special  devices  or  developing  machines  which  are  adapted  for  provid¬ 
ing  uniform  contact  of  the  entire  surface  of  the  light-sensitive  layer 
with  the  processing  solutions  and  the  wash  water.  Devices  with  spiral 
winding  of  the  photographic  film  may  be  used  for  this  purpose;  the  spi¬ 
ral  coll  with  the  photographic  film  is  submerged  in  a  tank  with  the  so¬ 
lution,  where  the  processing  solutions  enter  t’  ^aps  between  the  spi¬ 
ral  turns.  The  Karacharov  Plastics  Plant  produces  such  a  tank,  designed 
for  processing  30-meter  lengths  of  35-  or  16-mm  photographic  film. 
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The  manual  developing  devices  are  advisable  for  use  when  it  Is 
necessary  to  process  small  quantities  of  photographic  film.  For  large 
quantities  (more  than  300  m  per  working  day)  special  developing  ma¬ 
chines  are  used.  In  the  latter  the  roll  photographic  film  passes  con¬ 
tinuously  through  all  the  stages  of  the  chemical  processing:  develop¬ 
ment,  fixing,  washing,  and  drying.  The  developed  and  dried  photographic 
film  is  gradually  wound  Into  a  roll.  The  Leningrad  Cine  Equipment  Plant 
Leninkap  produces  the  6CP-4  developer  intended  for  chemical  processing 
and  drying  of  16  and  35  mm  perforated  or  unperforated  Mikrat  photo¬ 
graphic  film.  The  productivity  of  this  machine  (rate  of  photographic 
film  travel)  is  23,  47,  93  or  185  m/hr.  The  productivity  is  determined 
by  the  necessary  duration  of  stay  of  the  photographic  film  in  the  de¬ 
veloping  solutions.  For  photographic  film  of  the  Mikrat  type  the  devel¬ 
oping  lasts  4-8  min„  which  corresponds  to  film  travel  rates  through  the 
60P-4  developer  of  47-93  m/hr.  After  the  chemical  processing  the  micro¬ 
films  are  subjected  to  careful  visual  inspection  in  order  to  eliminate 
possible  mechanical  damage  (scratches)  and  other  defects  which  can  in- 
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fluence  the  readability  of  the  reproduced  documents. 

Any  desired  number  of  roll  positive  copies  may  be  printed  by  the 
contact  method  (without  changing  the  Image  scales)  from  the  resulting 
roll  negatives  of  the  document  microphotocopies.  Special  high-produc¬ 
tion  copying  devices  are  used  for  this  purpose. 

Soviet  industry  produces  three  types  of  copiers  for  printing  ml- 
crophotocopie3  on  roll  photographic  film:  MKP-1,  MKT-?,  and  MKP-3.  The 
MKP-1  and  MKP-2  copiers  differ  In  that  the  first  is  intended  for  print¬ 
ing  on  roll  photographic  film  of  width  16  and  3:>  nan,  while  the  second 
is  for  35  and  70  mm. 

Figure  4  shows  a  general  view  of  the  MKT-1  copier-.  The  printing 
rate  on  this  unit  Is  150  m/hr.  The  KKF-3  copier  is  a  very  portable  de- 
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vice  (Pig.  5),  intended  for  printing  on  36  and  35-mm  photographic  film. 
The  printing  rate  is  50  m/hr.  The  AKD-55  mass-produced  copier  (Pig.  6) 
may  be  used  for  printing  individual  frames  with  a  microimage.  It  is 
recommended  for  selective  printing  of  individual  frames  of  dimension 
2H  x  36  and  18  x  24  mm  from  35-ml llimeter  roll  negative  onto  35-milli¬ 
meter  positive  roll  photographic  film.  The  unit  is  very  portable,  has 
dimensions  of  235  x  195  x  172  mm;  a  darkroom  is  not  required  for  opera¬ 
tion. 

Soviet  industry  produces  photographic  films  (see  above)  and  all 
the  necessary  equipment  for  large-scale  preparation  of  microphotocopies 
on  roll  photographic  film  with  reductions  to  20  times  and  more.  Micro¬ 
photocopies  on  roll  photographic  material,  the  so-called  microfilms, 
have  found  wide  application  in  this  country.  The  technological  process 
for  their  production  is  usually  termed  microfilming.  In  many  cases  this 
process  is  based  on  the  highly  developed  cinematographic  experience; 
this  obviously  explains  the  wide  usage  of  microphotocopying  in  the  roll 
version.  However,  the  arrangement  of  the  microphotocopies  on  rolls  has 
several  essential  drawbacks,  which  complicate  the  use  of  the  micropho¬ 
tocopies  both  for  manual  operation  and  for  creating  automatic  systems 
for  information  retrieval.  It  is  difficult  to  work  with  the  strip  in 
roll  form,  special  devices  and  time  are  required  for  loading  the  roll 
into  the  viewing  device  (for  reading).  If  the  required  material  is  lo¬ 
cated  at  the  end  of  the  roll  it  is  necessary  to  reroll  the  reel,  which 
is  time  consuming  and  requires  the  use  of  specialised  equipment. 

It  is  Impossible  to  design  a  flexible,  efficiently  expandable 
system  for  information  retrieval  and  output  (it  is  impossible  to  intro¬ 
duce  new  microphotocopies  without  cutting  the  roll)  on  the  basis  of  the 
roll  arrangement  of  the  microphotocopies. 

These  drawbacks  resulting  from  the  arrangement  of  the  microphoto- 
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copies  on  a  roll  are  completely  eliminated  with  arrangement  of  the  mi¬ 
crophotocopies  on  individual  segments  of  photographic  material  -  micro¬ 
cards  . 


Thus,  the  reduced  photographic  copies  of  documents  -  microphoto¬ 
copies  are  divided  into  two  forms:  microfilms  and  microcards. 

3.  Equipment  and  Technology  for  Mass  Production  of  Microcards 

Microcards  have  not  yet  found  wide  practical  application  in  the 
Soviet  Union,  therefore  the  name  itself  is  somewhat  unusual.  3y  a  ml- 
crocard  we  mean  a  card  fabricated  from  photographic  material  or  printed 
by  a  polygraphic  method,  whose  surface  has  the  image  of  microphotocop¬ 
ies  of  documents.  The  microcard  may  be  fabricated  on  a  transparent 
(transparent  film)  or  nontransparent  base  (white  paper) i  in  the  latter 
case  it  is  possible  to  arrange  the  microphotocopies  on  both  sides  of 
the  card. 

Outside  the  Soviet  Union  it  is  usual  to  use  the  term  miorooard  for 
microcopies  arranged  on  cards  made  from  photographic  paper  (on  a  non¬ 
transparent  base).  The  exact  same  cards,  made  from  photographic  film 
(on  a  transparent  base),  are  termed  micro fioha ,  while  those  fabricated 
from  paper  with  reduced  images  of  the  documents  printed  by  a  polygraph¬ 
ic  method  are  termed  mioroprinta  [5]* 

In  our  translated  literature  we  encounter  the  terra  diamiorooard , 
i.e.,  microcards  on  a  transparent  base  which  may  be  read  in  transmitted 
light,  and  apimiorooarda ,  microcards  on  a  nontransparent  base  which  are 
read  in  reflected  light. 

The  simplest  is  the  microcard  with  a  single  row  arrangement  of  the 
document  microcopies.  Such  microcards  may  be  fabricated  by  cutting  the 
roll  microfilm  into  Individual  pieces  corresponding  to  the  length  of 
the  microcard.  Considerably  more  complex  is  the  preparation  of  micro- 


cards  with  arrangement  of  the  document  pages  in  several  rows;  with  this 
arrangement  the  microcard  takes  on  the  form  of  the  conventional  card 
catalog  card.  The  number  of  pages  (frames)  of  the  documents  and  rows  on 
the  microcard  depends  on  the  size  of  the  originals,  the  reduction  ra¬ 
tio,  and  the  microcard  format. 

As  a  rule,  the  microcopies  of  documents  must  be  arranged  on  the 
microcards  Just  as  the  words  and  lines  cf  a  manuscript  or  printed  text 
are  arranged,  from  left  to  right  and  from  top  to  bottom. 

The  most  widely  used  microcard  dimension  is  75  x  125  mm  (Fig.  7)  - 
the  format  of  the  standard  bibliographic  card.  Les3  widely  used  is  the 
105  x  1*J8  mm  format.  In  the  systems  for  mechanized  information  search 
which  use  as  the  information  carrier  nicrophotocopies  of  documents, 
quite  different  formats  are  permitted,  depending  on  the  conditions  of 
system  operation. 

In  principle,  the  same  photographic  materials  may  be  used  for  pre¬ 
paring  the  microcards  as  were  used  for  preparing  roll  microphotocopies 
with  the  single  essential  difference  that  the  photographic  film  must 
not  be  curled,  i.e.,  it  always  remains  flat,  regardless  of  temperature 
oscillations  and  humidity  changes  of  the  surrounding  air.  Considering 
that  the  photographic  film  is  a  system  of  se.eral  layers  with  different 
physical  and  mechanical  properties,  the  satisfaction  of  this  require¬ 
ment  is  related  with  increased  complication  of  the  fabrication  process 
for  the  phovograpnic  film  itself.  At  the  present  time  the  back  side  of 
the  photographic  film  opposite  the  light-sensitive  layer  is  coated  with 
the  same  thickness  of  an  anticurl  layer,  which  leads  to  a  general 
thickening  of  the  material.  Development  is  continuing  cf  a  special  lac¬ 
quer  coating  for  the  photographic  film  to  ensure  it.  flatness. 

Three  technological  processes  are  known  for  fei  icatlng  microcards 
with  multi-row  arrangement  of  the  document  microcop  ,*s. 
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Fig.  7.  Microcard  (multiple-frame  microcard). 

1.  The  roll  negative  of  the  document  microphotocopies  is  cut  into 
individual  strips  corresponding  to  the  length  of  the  rows  on  the  micro- 
card.  Then  these  strips  are  arranged  on  a  glass  as  they  are  to  be  posi¬ 
tioned  on  the  microcard  copy  and  from  this  montage  contact  prints  are 
made  of  the  required  number  oJ  microcards.  This  method  is  quite  accept¬ 
able  if  we  are  not  required  to  retain  the  negative,  which  must  be  cut 
into  individual  pieces.  This  method  essentially  does  not  require  any 
special  equipment  and  is  available  to  any  photographic  lab<.  tory  which 
has  equipment  for  contact  printing.  The  montage  of  the  microcards  from 
the  negative  film  strips  is  a  very  simple  manual  operation. 

3.  The  microfilming  of  the  documents  is  performed  with  multi-row 
arrangement  of  the  documents.  For  this  we  use  a  special  wide  photo¬ 
graphic  film  corresponding  to  the  width  of  the  microcards.  After  expo¬ 
sure  the  film  is  processed  by  the  usual  method,  and  the  required  number 
of  microcards  are  printed  from  the  resulting  negatives  oy  the  contact 
method.  Thus,  the  multi-row  arrangement  of  the  microphotocopies  of  the 
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documents  Is  accomplished  here  in  the  photographing  process.  For  this 
purpose  we  require  specialized  microfilming  equipment  for  static  film¬ 
ing  with  stepped  transverse  movement  of  the  photographic  film  for  pho¬ 
tographing  each  subsequent  row  on  the  microcard.  Such  equipment  has 
comparatively  low  productivity,  particularly  in  comparison  with  the  mi¬ 
crofilming  equipment  for  dynamic  filming.  For  mass  production  of  micro- 
cards  using  this  method  it  would  be  necessary  to  build  specialized  mi¬ 
crofilming  equipment,  since  the  equipment  which  we  produce  with  single¬ 
row  arrangement  of  the  microcopies  of  the  documents  cannot  be  utilized. 

3.  This  method  is  of  most  interest,  since  it  is  based  on  the  ex¬ 
isting  high-productivity  microfilming  equipment  with  single-row  ar¬ 
rangement  of  the  document  microphotocopies.  While  in  the  second  method 
the  multi-row  arrangement  of  the  document  microphotocopies  was  accom¬ 
plished  during  the  negative  process  (during  microfilming),  here,  during 
the  positive  process  a  contact  print  is  made  from  the  roll  negative  of 
the  microcopies  onto  a  wide  positive  photographic  film  (corresponding 
to  the  width  or  length  of  the  microcards)  so  that  in  each  row  of  the 
microcard  there  is  printed  the  corresponding  strip  (piece)  of  the  roll 
negative. 

Thus,  in  place  of  tha  equipment  for  stepped  microfilming  here  we 
require  equipment  for  stepped  multi-row  contact  printing,  which  is 
simpler  than  the  equipment  for  stepped  filming. 

It  must  be  emphasized  that  in  this  method  we  use  the  roil  negative 
prepared  on  the  conventional,  widely  U3ed  microfilming  equipment.  From 
the  roll  negative  we  can  print  microcar^s  or  roll  positive  copies  - 
micro films . 

A  prototype  of  a  manual  copier  for  printing  microcards  from  roll 
microfilms  has  been  built  at  VINTTI  (Fig.  8).  The  equipment  permits 
printing  microcards  on  roll  photographic  film  or  photographic  paper  of 
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75-mm  width.  Printing  is  performed  from  roll  negatives  onto  perforated 
or  unperforated  photographic  film  of  16  or  35-mm  width. 

The  positive  photographic  film  passes  through  the  contact  table  in 
the  chamber  1.  The  negative  photographic  film  is  wound  on  the  reel  2, 
passes  under  the  glass  which  presses  the  two  films  together  at  the  mo¬ 
ment  of  printing.  After  printing  each  successive  row  the  chamber  is 
shifted  by  the  width  of  one  row  and  the  negative  photographic  film  is 
shifted  for  printing  the  succeeding  microphotocopies.  The  prototype  of 
the  manual  copier  enables  the  printing  of  the  order  of  100  microcards 
per  ’.forking  day. 

A  prototype  of  a  copier  for  printing  microcards  from  roll  micro¬ 
films  has  also  been  fabricated  at  NIKFI.  The  equipment  is  designed  for 
printing  from  35-mm  perforated  photographic  illm  onto  roll  photographic 
material  (film  or  paper).  The  Moscow  Design  Bureau  for  Cine  Equipment, 
MKBK,  following  the  specifications  of  VINITI  and  NIKFI  has  developed  a 
semiautomatic  copier  for  printing  microcards  from  roll  microfilms  (Pig. 
9).  The  productivity  of  the  unit  is  of  the  order  of  200  four-row  micro¬ 
cards  per  hour.  Tne  entire  row  being  printed  is  viewed  on  the  screen  1 
with  a  5-fold  magnification,  or  individual  microcopies  are  viewed  with 
a  25-fold  magnification.  The  equipment  provides  for  selective  printing, 
i.e.,  printing  individual  frames  from  the  negative. 

The  chemical  processing  of  wide  photographic  film  with  microcards 
printed  from  the  roll  negatives  is  accomplished  just  as  the  usual  roll 
photofilm  with  positive  microphotocopies.  Equipment  is  being  developed 
at  VINITI  for  developing  and  drying  roll  photographic  films  with  micro¬ 
cards  and  for  cutting  the  roll  photographic  film  irto  individual  micro- 
cards. 

Very  promising  for  the  efficient  and  economical  production  of  cop¬ 
ies  from  microcards  la  the  use  of  dlazo  film,  on  which  copies  may  ^e 
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Pig.  8.  Prototype  of  copier  for 
printing  microcards  from  roll 
microfilms. 


Pig.  9*  The  AKPM  (developed  by 
MKBK)  copier  for  printing  micro 
cards  from  roil  microfilms. 
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made  from  microcards  directly  at  the  place  of  storage  immediately  after 
selecting  the  required  microcard. 

In  addition  to  the  microcards  on  photographic  material  which  per¬ 
mits  producing  microcopies  with  large  magnifications,  of  interest  are 
the  materials  printed  by  the  typographic  method  with  some  reduction  of 
the  text  dimensions  -  up  to  5  times. 

In  addition  to  the  size  reduction,  such  microeditions  are  signifi¬ 
cantly  cheaper  than  the  conventional  typographic  editions  (the  cost  is 
reduced  approximately  in  proportion  to  the  square  of  the  reduction  ra¬ 
tio).  Special  optical  devices  are  required  for  reading  the  reduced 
text.  Microeditions  are  particularly  advantageous  for  long  runs. 

III.  MECHANIZATION  AND  AUTOMATION  OF  INFORMATION  SEARCH 

It  is  difficult  to  imagine  a  system  for  mechanizing  information 
search  with  the  traditional  storage  of  paper  documents  on  shelves  and 
racks,  when  they  are  dispersed  in  space.  It  is  a  different  matter  if 
huge  masses  of  documents  are  compressed  into  a  small  volume  and  the  re¬ 
quired  document  is  automatically  selected  from  an  array,  made  available 
for  reading,  or  a  copy  is  made  from  it.  Microphotocopying  has  created 
the  primary  basis  for  the  mechanization  and  automation  of  Information 
search  -  the  reduction  of  information  into  a  comparatively  small  vol¬ 
ume.  The  mechanization  and  automation  system  based  on  document  micro- 
photocopies  usually  contains  two  elements:  document  microphotocopies  on 
a  roll  (on  a  tape)  with  some  sequential  arrangement  of  the  materials, 
or  in  Individual  segments  of  photographic  material  (microcards),  and 
the  search  labels  for  the  cox'responding  documents. 

The  search  labels  may  characterize  the  document  content  with  vary¬ 
ing  degree  of  detail.  In  the  simple  cases  these  are  brief  bibliographic 
summaries,  while  in  the  complex  cases  they  are  a  detailed  description 
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of  the  document  which  provide  for  accomplishing  a  logical  search.  The 
search  labels  are  arranged  alongside  the  document  microphotocopies  or 
are  located  separately  from  them. 

The  first  case  is  advantageous  when  the  selection  is  mad*  from  a 
large  number  of  documents  with  respect  to  a  comparatively  small  number 
of  labels.  The  possibility  is  created  for  performing  a  single-step 
search  —  the  desired  document  is  selected  simultaneously  with  the 
search  labels. 

With  a  large  number  of  search  labels  for  a  comparatively  small 
volume  of  documents  it  is  advisable  that  the  search  labels  be  separate 
from  the  documents.  Then  the  search  is  accomplished  in  two  stages: 
first  we  find  the  storage  place  of  the  documents  (address)  from  the 
search  labels,  after  which  the  document  itself  is  selected. 

Three  methods  of  writing  the  search  labels  are  known:  perforation, 
photographic,  and  magnetic.  The  perforation  writing  of  the  search 
labels  is  accomplished  by  punching  in  a  definite  combination  openings 
over  the  entire  field  or  along  the  edges  of  paper  cards  or  tape.  The 
document  microcopies  are  mounted  in  a  window  in  the  paper  cards;  such 
c ■>mbined  punch  cards  with  microcards  are  usually  termed  aperture  punch 
cards.  The  photographic  writing  of  the  search  labels  is  a  combination 
of  dark  and  transparent  segments  on  photographic  material.  Very  high 
code  density  can  be  achieved  by  this  method  (see,  for  example.  Fig. 

18).  The  magnetic  recording  of  the  search  labels  is  accomplished  in  a 
magnetic  layer  by  combining  magnetized  and  nonmagnet  1  zed  segments  of  a 
tape  or  card.  The  magnetic  layer  may  be  arrsnged  alongside  the  photo¬ 
graphic  strip  where  the  document  text  is  recorded.  A  special  photomag- 
netic  material  on  which  magnetic  tracks  are  located  beside  the  photo¬ 
graphic  layer  is  being  developed  at  NIKFI . 

In  principle  it  is  possible  to  combine  the  rhet-- -graph le  and  mag- 


netlc  recording  of  the  code,  of  which  the  first  carries  invariable 
(constant)  labels,  and  the  second  carries  variable  labels. 

Search  systems  are  known  which  use  roll  microphotocopies  arranged 
on  individual  segments  of  photographic  film  (microcards).  The  roll  ar¬ 
rangement  has  the  following  significant  drawbacks:  a)  the  impossibility 
of  adding  or  subtracting  from  the  roll  any  portion  of  the  material 
(without  cutting  the  roll  and  subsequent  splicing),  i.e.,  the  impossi¬ 
bility  of  obtaining  a  flexible,  adaptive  system;  b)  in  order  to  accom¬ 
plish  sequential  search  with  the  soft  material  located  at  the  end  of 
the  roll,  it  is  necessary  to  rewind  the  entire  roll;  c)  the  search  rate 
(rate  of  movement  of  the  photographic  film  tape)  is  limited  by  the  fact 
that  during  rewinding  of  the  roll  there  are  two  variable  masses  -  one 
roll  diminishes  (from  which  the  film  is  unwound),  the  other  grows. 

These  drawbacks  are  completely  eliminated  when  using  microcards, 
which  are  easily  added  to  as  new  material  arrives,  unnecessary  micro¬ 
cards  are  taken  out,  during  search  there  is  no  need  for  looking  sequen¬ 
tially  through  the  entire  array  of  material;  since  the  individual  mi¬ 
crocards  have  very  little  weight,  inertialess  systems  may  be  created 
for  their  transportation,  providing  for  adequate  speed  of  operation. 

However,  th*  roll  version  has  certain  advantages:  the  possibility 
of  losing  individual  document  pages  is  eliminated;  the  roll  is  more 
easily  subjected  to  mechanized  sterilization  processing  which  is  re¬ 
quired  for  long-term  storage  of  the  microphotocopies. 

The  systems  for  information  storage  and  search  based  on  micropho¬ 
tocopies  of  the  documents  differ  in  the  method  of  recording  the  code  of 
the  search  labels  (perforation,  photographic,  and  magnetic);  arrange¬ 
ment  of  the  search  label  code  separate  from  the  document  mlcrophotocop- 
les  (two-step  search)  or  together  with  the  document  microphotocopies 
(single-step  search);  arrangement  of  the  microphotocopies  on  roll  pho- 
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tographic  material  or  on  Individual  segments  of  the  photographic  mate¬ 
rial  -  microcards. 

1.  Manual  Search 

Roll  negative  and  roll  positive.  The  microphotocopies  of  the  docu¬ 
ments  are  arranged  on  roll  photographic  film  of  16,  35,  or  70  mm  width. 
Most  widely  used  is  the  roll  film  of  35-mm  width.  The  negative  provides 
for  long-term  storage  of  documents,  and  from  the  negative  there  are 
printed  out  positive  copies  which  serve  for  reading  the  documents  peri¬ 
odically  as  they  are  needed.  The  roll  length  is  determined  by  the  na¬ 
ture  of  the  documents  and  the  frequency  cf  referral  to  them.  In  the 
case  of  low  loading  of  the  information  system  (archive  materials)  the 
document  microcopies  may  be  grouped  in  large  rolls  of  up  to  300  meters 
-  roughly  more  than  15,000  pages  with  a  reduction  ratio  of  15. 

The  most  common  roll  lengths  are  30  meters  for  the  negatives  and 
3-4  meters  for  the  positives.  The  rolls  with  positive  microphotocopies 
are  stored  in  wooden  cabinets  with  sliding  drawers  (Fig.  10)  which  are 
equipped  with  cells  in  the  form  of  a  honeycomb.  Metal  or  plastic  boxes 
are  also  used  for  storing  the  rolls.  The  positive  copies  may  be  stored 
in  polyethylene  bags  or  without  any  packaging  if  dust  is  kept  away  from 
th  i.i.  The  rolls  are  numbered,  arranged  in  proper  order,  and  indexed  us¬ 
ing  a  card  file. 

This  system  for  raicrophotocopy  storage  and  retrieval  has  been  in¬ 
stalled  in  the  Lenin  Library,  the  Main  Archive  Administration,  Gldro- 
metslushba  and  other  installations. 

Roll  negative  and  positive  on  single-row  microcards.  The  roll  neg¬ 
ative  remains  in  the  form  described  in  the  preceding  section,  while  the 
roll  photographic  film  with  positive  microphotocopies  of  the  documents 
is  cut  into  individual  segments  -  microcards  of  established  length.  The 
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microcard  permits  the  inclusion  of  several  microcopies  of  documents  on 
the  same  theme  and  is  always  equipped  with  a  heading  frame  (Fig.  11) 
including,  for  example,  a  microcopy  of  the  title  page  of  a  journal,  a 
code  or  working  number  which  is  used  to  code  the  article.  The  latter 
are  made  large  enough  so  that  they  are  visible  on  the  film  without  mag¬ 
nification.  The  microcards  are  stored  by  stacks  and  are  collected  into 
card  files. 


Fig.  10.  Cabinet  for  storing 
microfilms . 

In  the  GDR  (Dokumator  system)  albums  are  used  on  the  sheets  of 
which  pockets  are  made  for  the  microcards. 

In  comparison  with  the  roll  systems  the  microcard  systems  have  the 
following  advantages:  a)  it  is  more  convenient  to  find  all  the  required 
material  in  the  card  file;  b)  it  is  easier  to  load  the  microcards  into 
the  equipment  for  reading  the  microcopies;  c)  only  the  required  docu¬ 
ment  is  located  on  the  microcard.  If  a  single  article  is  located  on 
each  microcart*  tack  of  microcards  with  the  annual  set  of  articles 

of  a  journal  will  have  a  thickness  of  tb*  order  of  50  mm  (assuming  that 
each  number  of  the  Journal  has  25  articles). 


Negative  and  positive  on  single-row  gilcrocards.  In  contrast  with 
the  system  described  In  the  preceding  section.  In  this  case  the  roll 
film  with  the  negative  microphotocopies  Is  also  cut  Into  Individual 
pieces.  This  makes  It  easier  to  select  materials  for  printing  the  posl 
tlves;  It  is  difficult  to  accomplish  selective  contact  printing  from 
the  roll  negative  (on  the  existing  copying  equipment).  At  the  same  time 
the  location  of  the  negative  microphotocopies  on  the  segments  increases 
considerably  the  possibility  of  their  loss. 

Roll  negative  and  positive  on  microcards  with  multi-row  arrange¬ 
ment  of  document  microcopies.  In  contrast  with  the  simple  microcards 
with  a  single-row  arrangement  of  the  microphotocopies,  here  the  latter 
are  arranged  on  the  microcards  themselves  in  several  rows  from  left  to 
right  and  from  top  to  bottom.  This  arrangement  leads  to  a  convenient 
format  of  the  microcopy,  similar  to  the  widely  used  format  for  catalog 
cards.  An  example  of  the  multi-row  microcards  printed  from  roll  nega¬ 
tives  are  the  so-called  summary  microcards  with  the  image  of  a  large 
number  of  microcopies  of  the  catalog  cards  or  other  card-catalog  mate¬ 
rials  (see  Fig.  7). 

With  a  reduction  ratio  of  14,  on  a  single  summary  card  of  75  *  125 
ran  format  [6]  there  are  located  144  standard  bibliographic  cards,  with 
a  reduction  of  25  there  are  350  such  cards,  ano  with  a  reduction  of  33 
there  are  700  cards.  The  summary  microcards,  retaining  the  usual  order 
of  sequential  arrangement  of  the  cards  in  the  card  file,  significantly 
reduce  its  volume  and  facilitate  operations.  Even  with  a  14-fold  reduc¬ 
tion,  one  million  bibliographic  cards,  transferred  to  the  nummary  mi¬ 
crocards,  fit  into  5  card-catalog  drawers  (1500  cards  in  one  drawer); 
with  the  usual  arrangement  this  same  volume  would  occupy  650  drawers. 
The  Introduction  of  the  summary  microcards  also  solves  the  problem  of 
coding  and  duplication  of  the  cards. 
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Using  the  scheme  described  for  storage  and  retrieval  -  negative  on 
8  reel,  positive  on  a  microcard  -  the  collection  of  any  documental  ma¬ 
terials  may  be  conveniently  organized  so  that  materials  on  the  same 
subject  will  be  located  on  an  individual  microcard.  The  practical  im¬ 
plementation  of  this  scheme  requires  a  copier  for  printing  micrccards 
from  roll  microfilms.  The  sample  summary  microcard  shown  in  Pig.  7  was 
produced  on  a  prototype  copier  developed  at  VINITI. 

Aperture  punchcards  are  a  combination  of  punch  cards  and  micro¬ 
cards  [7].  A  window  in  which  the  microcard  is  mounted  is  made  in  the 
punch  card.  Three  window  dimensions  are  provided  for  punch  cards  with 
edge  perforations  of  207  *  148  mm  format:  35  *  48,  35  *  105,  and  75  * 
x  105  mm,  which  correspond  to  4,  12,  and  40  pages  of  Journal  text  for  a 
15-20-fold  reduction.  Mounting  of  the  microcards  is  accomplished  by  ap¬ 
plying  narrow  strip  of  adhesive  tape  of  5-6  mm  width  along  the  perim¬ 
eter  of  the  window  so  that  a  portion  of  this  strip  is  on  the  surface  of 
the  punch  card  wnile  the  other  part  is  on  the  surface  of  the  microcard. 
The  microcard  slips  into  the  window  and  lies  in  the  same  plane  with  the 
punchcard.  The  adhesive  tape  is  produced  on  thin  (less  than  0.1  mm) 
transparent  film,  one  side  of  which  is  coated  with  the  adhesive  layer 
and  the  tape  is  wound  into  a  roll  with  the  adhesive  layer  inside.  The 
film  base  is  cellophane  (produced  by  the  Mosgorsovnarkhoz  firm)  or  pol¬ 
yethylene  (Okhtlnsk  Chemical  Combine,  Leningrad).  Adhesive  tape  on  a 
very  thin  (0.05  mm)  and  strong  lavsan  film  is  being  readied  for  produc¬ 
tion. 

The  aperture  punchcard  (Pig.  12a)  carries  a  bibliography,  search 
labels  which  are  entered  by  cutting  out  edge  openings,  microcopies  of 
the  documents,  and  an  abstract  (Pig.  I.H.).  i  edition  i  ith  a  card  file 
of  manual  aperture  punch  card:;  i*  accomplish*- .1  nr.  fi  ii- ws:  on  *hc  basis 
of  the  posed  questions  a  search  Is  made  for  U.  necessary • cards  using  a 
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Pig.  i?.  Aperture  punchcard.  a)  Pace  side;  b)  reverse  side.  "A  critique 
of  the  Urey  theory,  according  to  which  tektites  might  have  formed  as  a 
result  of  melting’ of  terrestrial  sedimentary  rocks  on  collision  with  a 
comet,  baaed  on  a  nonmeteoritic  theory  of  the  origin  of  tektites,  With 
reference  to  an  example  of  original  shape,  surface  structure  and  con¬ 
ditions  of  propagation  and  growth  of  australites,  it  is  suggested  that 
tektites  came  to  the  earth  from  elsewhere  and  acquired  secondary  forms 
at  supersonic  and  hypersonic  velocities  through  the  atmosphere*  Action 
of  atmospheric  agents,  over  the  5000  years  of  their  existence*  gave 
rise  to  the  appearance  of  certain  original  sculptural  forms." 

-  0.  Vorob'yev. 
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selector  needle  (the  cards  are  arranged  at  random  in  the  card  file), 
the  abstract  is  read  on  the  selected  cards,  and  if  necessary  the  card 
is  placed  in  a  conventional  reader  for  reading  or  copying  (see  above). 

Klyassernyy  punchcards.  Small  pockets  are  provided  on  one  side  of 
the  punchcard  in  which  microcopies  of  the  documents  are  placed  in  the 
form  of  microcards.  While  the  aperture  punchcards  do  not  provide  the 
possibility  of  introducing  additional  information  or  removing  obsolete 
materials,  the  klyassernyy  cards  have  a  clear  advantage  in  this  re¬ 
spect.  The  pockets  are  made  from  a  transparent  thin  film,  which  permits 
examining  the  heading  on  the  microcard  or  reading  the  entire  text  on  a 
reader  using  reflected  light  without  taking  the  card  from  the  pocket. 
The  klyassernyy  punchcards  may  be  useful  in  office  work  and  in  all  oth¬ 
er  cases  related  with  the  movement  of  documents  concerning  a  single 
common  question. 

Figure  13  shows  a  photograph  of  a  klyassernyy  punchcard  from  the 
side  of  the  pockets  for  inserting  the  microcopies.  The  pockets  of  the 
klyassernyy  punchcards  provide  reliable  protection  of  the  microcard 
surface  from  mechanical  damage. 

2.  Some  Systems  for  Highly  Mechanized  Automated  Search,  Based  on  the 
Use  of  Microphotocopies  of  Documents 

The  principle  of  highly  mechanized  search  is  that  using  a  coded 
form  of  the  search  labels  a  servo  device  selects  the  document  microcopy 
(if  it  is  located  alongside  the  search  label  code)  or  Indicates  its 
storage  location  (address)  in  order  to  accomplish  the  search  the  mate¬ 
rial  (photographic  film,  magnetic  tape,  magnetic  card)  on  which  the 
search  label  code  is  recorded  is  transported  relative  to  the  servo 
unit.  The  latter  is  a  block  of  photodiodes  or  a  block  of  magnetic  heads 
with  corresponding  electronic  logic  circuit  for  receiving  and  transmit- 
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ting  signals.  To  accomplish  the  search  a  request  is  entered  into  the 
searching  unit. 

As  indicated  above,  the  information  search  systems  are  based  on 
tape  (roll)  or  cards. 

Search  systems  using  microcopies  on  roll  photographic  film.  The 
Rapid  Selector  search  system  [8]  using  roll  photographic  film  began  op¬ 
eration  back  in  the  30' s  at  MIT  (USA).  The  document  microcopies  and  a 
photographic  code  record  of  their  search  labels  are  arranged  on  35-mm 
film.  The  film  passes  through  a  servo  unit  with  a  speed  of  100  m/min 
(3000  pages  of  documents).  The  photographic  code  of  the  search  labels 
(up  to  240  cells)  is  located  alongside  each  document.  When  the  required 
document  is  detected  its  photocopy  is  automatically  printed  without 
slowing  the  speed  of  the  photographic  film. 

More  advanced  models  of  the  Rapid  Selector  system  are  presently 
being  developed  with  higher  film  travel  speed  (by  a  factor  of  2-4). 

Tne  Flip  machine  of  the  Benson-Lehner  firm  (USA).  Roll  16-mm  per¬ 
forated  photo  film  is  used.  Alongside  the  microcopies  of  the  documents 
the  32-symbol  photographic  code  Is  recorded.  On  a  roll  of  length  376 
meters  there  are  72,000  frames,  the  roll  speed  is  1.5  m/sec.  The  se¬ 
lected  frame  is  projected  on  a  screen  with  a  magnification  of  the  order 
of  50. 

The  File  search  machine  of  the  FMA  firm  (USA).  On  roll  photograph¬ 
ic  film  of  length  330  meters  there  are  32,000  pages  and  scanning  is  ac¬ 
complished  with  a  speed  of  6400  page?/r.in  (photographic  film  speed  is 
1  m/sec).  Up  to  392  code  symbols  may  be  recorded  on  each  frame. 

The  Poisk  T  and  Pclck  OK  search  units  using  ?5-mm  imperforated 
photographic  film  are  being  developed  at  Y1NITI  of  the  Academy  of  Sci¬ 
ences  USSR. 

The  Filmsort  system  of  the  Minnesota  Mining  and  Manufacturing  firm 


(USA)  is  based  on  a  combination  of  machine  punchcards  of  187  *  82  mm 
format  and  microphotocopies  of  the  documents  [9],  The  latter  are  ar¬ 
ranged  on  a  microcard  of  size  35  *  48  mm  mounted  in  the  window  of  the 
punchcard  (Pig.  I1!). 

More  than  50  columns  on  the  80-column  punchcard  may  be  used  for 
recording  the  search  label  code.  Standard  punchcards  and  an  assortment 
of  counting-perforating  machines  are  used.  However,  in  order  to  avoid 
rapid  wear  of  the  aperture  cards  it  is  better  to  perform  the  search 
using  conventional  punchcards  in  order  to  then  select  manually  the  re¬ 
quired  aperture  punchcards.  The  Filmsort  system  has  found  application 
in  construction  drawing  documentation  services. 

A  manual  machine  with  an  average  productivity  of  400  cards  per 
hour  (Pig.  15)  and  an  automatic  machine  (Fig.  16)  with  an  output  of 
2000  cards  per  hour  are  used  for  mounting  the  microcards  in  the  punch- 
card  windows. 

Contact  copying  of  the  microphotocopies  onto  the  aperture  punch- 
cards  is  accomplished  on  diazo  film,  mounted  in  the  punchcard.  The  ap¬ 
erture  cards  with  unexposed  photographic  film  (diazo  film)  are  deliv¬ 
ered  in  finished  form;  to  obtain  copies  from  the  aperture  punchcards 
use  is  made  of  a  special  portable  copier  for  printing  from  card  to  card 
(Pig.  17). 

The  Mlnlcard  system  of  the  Eastman  Kodak  firm  (USA).  The  micropho¬ 
tocopy  of  the  document  or  group  of  documents  is  arranged  together  with 
the  photographic  code  on  the  microcards  [10]  (Fig.  18).  The  microcard 
size  is  16  *  32  mm.  The  microcards  contain  up  to  12  document  pages  with 
a  reduction  ratio  of  60  and  2730  symbols  of  the  search  label  code. 

There  is  available  a  negative  Mlnlcard  microcard  from  which  any  desired 
number  of  positive  copies  to  be  used  in  the  search  system  may  be  print¬ 
ed.  All  the  microcards  on  a  common  subject  are  threaded  on  a  common  rod 
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(there  is  an  opening  in  the  upper  part  of  the  microcards  for  this  pur 
pose)  and  are  stored  in  this  form. 
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1^4.  Aperture 

punchcard  of  the 

Filmsort  system. 

GRAPHIC  NOT 
REPRODUCIBLE 

Fig.  15-  Manual  machine  for 
mounting  Filmsort  aperture 
punchcards . 


Fig.  16.  Automatic  machine  for 
mounting  Filmsort  aperture 
punchcards . 


I  * 
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Such  an  arrangement  provides  a  very  flexible  possibility  for  add¬ 
ing  to  or  removing  part  of  the  material.  To  accomplish  the  search  the 
rod  with  .nicrocards  is  introduced  into  a  search  unit  with  a  handling 
capacity  of  1800  units/mln.  Copies  are  automatically  printed  from  the 
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selected  micrccards,  or  the  text  Is  projected  on  the  screen  of  a  read¬ 
er.  The  system  provides  extremely  compact  document  storage. 


GRAPHIC  NOT 
REPRODUCIBLE 


Pig.  17.  Unit  for  printing  from 
card  to  card. 


The  Minicard  system  was  first  introduced  into  operation  in  1953 
in  the  USA  Department  of  the  Air  Force  in  the  Pentagon.  Later  several 
governmental  departments  of  the  USA,  including  the  Central  Intelligence 
Agency,  converted  to  this  system.  The  entire  complex  consists  of  13  in¬ 
dividual  photographic,  optical,  electronic,  and  mechanical  units  of  a 
total  cost  from  2.5  to  3-5  million  dollars. 

The  Magnavue  system  of  the  Magnavox  firm  (USA).  Here  the  search 
label  code  is  recorded  magnetically  [11].  This  provides  an  advantage 
relative  to  the  photographic  method  of  recording  in  that  the  possibil¬ 
ity  is  provided  for  easily  introducing  corrections  and  additions. 


GRAPtOC  NOT 

reprowkible 


Pig.  18.  Microcard  of  the  Minicard 
system. 
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The  microphotocopies  of  the  documents  and  the  magnetic  matrix  are 
arranged  on  25  *  75  mm  size  microcards.  The  card  base  is  lavsan  which 
has  considerably  higher  strength  in  comparison  with  the  widely  used 
triacetate  base.  The  document  microcopies  occupy  one-third  of  the  card; 
*  the  remaining  two-thirds  are  covered  with  a  magnetic  layer  with  3000 

matrix  cells.  The  cards  pass  through  a  scanner  with  a  speed  of  5^00 
units/min.  Prom  the  magazine  where  they  are  stored  they  are  taken 
(sucked)  by  a  rotating  drum  to  the  surface  of  which  a  vacuum  is  ap¬ 
plied.  This  method  of  transporting  minimizes  the  card  wear  and  provides 
a  very  high  search  speed  (the  peripheral  speed  of  the  drum  rotation  is 
on  the  order  of  10  m/sec),  which  is  impossible  with  the  use  of  roll 
photographic  film. 

The  cards  selected  during  the  scanning  are  sorted,  the  microcopies 
are  read  on  the  screen  of  a  reader  or  magnified  copies  corresponding  to 
the  dimensions  of  the  original  documents  are  printed  from  them.  The 
text  image  may  also  be  transmitted  over  a  television  channel. 

The  Mlcroclte  system  at  the  National  Bureau  of  Standards  (USA)  is 
based  on  a  combination  of  transparent  (superpositioning)  punchcards  and 
microphotocopies  of  documents.  The  search  symbols  for  18,000  documents 
are  stored  on  manual  transparent  punchcards,  the  microphotocopies  of 
these  documents  are  arranged  on  large  sheets  of  photographic  film  of 
360  x  360  mm  format.  The  sheet  of  photographic  film  with  the  micropho- 

♦ 

tocopies  is  placed  on  a  drum  (Fig.  19)  which  is  interlocked  with  coor¬ 
dinate  bars  indicating  the  space  corresponding  to  the  number  of  the  se¬ 
lected  document  [12,  133* 

The  search  is  accomplished  by  manual  selection  of  the  superposi¬ 
tion  cards  corresponding  to  the  interrogation.  The  cards  are  aligned  on 
top  of  one  another,  and  the  coordinate  bars  are  adjusted  to  the  lighted 
segments  on  the  upper  card.  Depending  on  the  p<  sit  Jon  if  the  bars  (co- 
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Fig.  19.  Schematic  of  the  Microcite  system. 

ordinates  of  the  light  gap)  the  drum  and  the  photographic  film  mounted 
on  it  take  a  position  where  the  microcopy  of  the  desired  document  is  in 
the  framing  window  and  its  magnified  image  immediately  appears  on  the 
screen. 

The  Verac  system  of  the  AVCO  firm  (USA).  The  microphotocopies  of 
the  documents  are  arranged  on  cards  made  from  photographic  film  of 
200  x  200  mm  format.  With  a  reduction  factor  of  140  there  are  10,000 
microcopies  on  a  single  card;  100  cards  contain  1  million  documents  [1, 
8].  To  obtain  microphotocopies  with  such  large  reduction  factors  use  is 
made  of  the  photochromic  process.  There  are  no  recordings  of  the  search 
labels  on  the  cards.  The  search  for  the  required  documents  is  accom¬ 
plished  with,  the  aid  of  the  manual  finder  (for  example,  using  superpo¬ 
sition  cards)  or  a  computer.  The  search  amounts  to  determining  the  co¬ 
ordinates  of  the  microcopy  of  the  document  on  the  card,  after  which  the 
required  document  is  found  within  2  seconds.  The  system  permits  obtain¬ 
ing  microcopies  of  the  documents  on  photographic  film  segments  of 
width  35  mm  or  magnified  copies,  right  up  to  the  full-size  dimensions 
of  the  document.  The  microcopy  may  also  be  examined  using  a  television 


channel. 


With  regard  to  compactness  of  document  arrangement  the  Verac  sys¬ 
tem  surpasses  all  the  previously  mentioned  systems.  In  addition,  having 
very  high  capacity,  it  can  provide  rapid  search  of  a  large  array  of 
documents  with  respect  to  a  large  number  of  labels  without  recourse  to 
complex  electronic  machines.  With  document  reduction  factors  greater 
than  100,  when  thousands  of  documents  are  arranged  on  the  small  area  of 
the  photographic  film,  there  is  no  need  for  complex  devices  for  trans¬ 
porting  during  search,  which  simplifies  considerably  the  search  device 
itself.  Thus,  the  high  reduction  ratio  of  the  documents  using  micropho¬ 
tocopying  has  solved  smother  important  problem  in  addition  to  the  prob¬ 
lem  of  compact  document  storage  -  the  problem  of  creating  simple  high¬ 
speed  systems. 

The  Walnut  system  of  the  IBM  firm  (USA)  solves  the  problem  of 
transporting  photographic  film  with  microphotocopies  arranged  on  the 
film  so  that  the  wear  of  the  photographic  film  is  reduced  to  a  minimum. 
Protection  of  the  photographic  surface  from  mechanical  damage  during 
its  passage  through  the  scanner  in  the  searcher  is  a  very  complex  prob¬ 
lem  which  is  solved  in  this  system  as  follows:  a)  the  search  label  code 
Is  separated  from  the  document  microcopies  and  is  arranged  separately 
on  magnetic  tape;  b)  the  microcards  with  the  microphotocopies  of  the 
documents  are  not  transported  -  they  are  stored  permanently  in  special 
magazines  and  are  taken  out  (at  low  speed)  for  copying  the  Information 

Cl*]. 

The  microphotocopies  are  reduced  in  two  steps.  First  microfilm  on 
35-mm  roll  photographic  film  is  prepared,  which  is  then  again  reduced 
during  printing  of  the  image  on  the  microcards;  the  over-all  reduction 
ratio  of  the  document  is  35-  On  each  microcard  there  are  arranged  3 
rows  with  .  ilcrocopies  per  row  -  99  pages  In  all.  Tie  microcards  are 
arranged  vertically  in  special  plastic  magazines  holding  r>0  units.  The 
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magazine  dimensions  are  26.7  *  39  mm,  height  397  mm  (containing  ^950 
microcopies).  The  magazines  are  mounted  on  a  drum  in  groups  of  200, 
which  thus  provides  for  a  minimal  information  capacity  of  the  system  of 
990,000  document  microcopies.  At  most  the  system  can  contain  more  than 
100  such  card-file  drums.  The  microcards  never  leave  the  magazine, 
therefore  their  wear  is  eliminated.  The  search  label  code  is  recorded 
separately  on  magnetic  tape.  The  search  for  the  required  documents  is 
accomplished  in  two  stages:  first  on  the  magnetic  tape  the  location  of 
the  document  is  established,  after  which  the  drum  begins  to  rotate  and 
the  magazine  where  the  microcard  with  the  given  document  is  located  is 
fed  to  a  special  set  of  fingers  for  its  removal.  After  the  microcard  is 
removed  from  the  magazine,  the  required  microcopies  are  illuminated  by 
ultraviolet  rays  and  the  image  is  printed  on  an  aperture  card  carrying 
unexposed  photographic  film  of  the  Kalvar  type.  Immediately  after  expo¬ 
sure  the  image  is  developed  by  heating,  and  the  information  is  deliv¬ 
ered  on  the  aperture  punchcard.  The  microcopy  may  be  read  with  the  aid 
of  a  reader  or  an  optical  magnified  copy  may  be  printed  from  it.  Only  5 
seconds  is  required  to  obtain  such  a  copy  from  any  interrogated  docu¬ 
ment.  On  the  average  600  aperture  cards  with  four  microcopies  on  each 
of  them  may  be  output  in  1  hour. 

IV.  READING  AND  EFFICIENT  CODING  OF  MICROPHOTOCOPIES 

Reading  of  the  microphotocopies  is  accomplished  on  special  readers 
intended  for  magnifying  them  to  full  size.  Stationary,  table-top,  port¬ 
able,  and  even  pocket  readers  are  available.  With  respect  to  nature  of 
the  illumination  we  differentiate  two  types  of  equipment:  for  reading 
in  transmitted  light  (microcopies  on  a  transparent  base)  and  in  re¬ 
flected  light  (microcopies  on  an  opaque  base).  There  are  available  also 
combined  readers  for  reading  in  transmitted  and  reflected  light. 
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The  type  5P01  Mlorofot  reader  (Fig.  20).  This  table-top  type 
equipment  is  produced  in  serial  quantities  by  the  Moscow  Cine  and  Elec* 
tromechanical  Plant  -  KEMZ.  It  is  intended  for  reading  in  transmitted 
light  of  roll  microfilms,  microstrips  (segments  of  35-mm  photographic 
film).  A  special  attachment  is  fabricated  at  VINIT1  for  reading  micro¬ 
cards  and  aperture  cards  (Pig.  21).  The  screen  size  is  300  x  360  mm . 

The  device  is  equipped  with  two  Jupiter-8  and  Jupiter-12  lenses  for 
reading  with  magnifications  of  10  and  16.  The  microcards  and  micro- 
strips  are  fed  into  the  equipment  for  reading  much  more;  sinvJv  "than  the 
microcopies  on  roll  photographic  film. 

The  AChS-1  reader  (Fig.  22).  An  experimental  model  has  beer,  devel¬ 
oped  and  fabricated  by  the  Moscow  Design  Bureau  for  Cine  Equipment  and 
serial  production  is  planned.  The  equipment  is  stationary,  intended  for 
reading  microphotocopies  on  roll  photographic  film  of  35  and  70-mm 
width  or  microcards.  The  following  magnifications  are  provided:  for  70- 
mm  photographic  film  -  7.7  and  13;  for  35-mm  photographic  film  -  18  and 
30.  The  screen  size  is  850  x  650  mm. 

The  AChM-2  reader.  An  experimental  model  has  been  developed  and 
fabricated  by  the  Moscow  Design  Bureau  for  Cine  Equipment,  preparations 
are  being  made  for  serial  production  .  The  portable  model  (Fig.  23)  is 
intended  for  reading  microphotoccpies  on  roll  photographic  film  of  16 
and  35-nun  width  or  microcards.  The  screen  size  is  550  x  550  mm,  the  im¬ 
age  Is  projected  on  the  table.  Magnifications  of  15  and  ?6  are  provid¬ 
ed.  This  equipment  may  be  used  not  only  in  institutions,  but  als?  under 
field  and  home  conditions.  It  may  also  be  used  as  a  projector  or  photo¬ 
graphic  enlarger. 

The  AChK-3  pocket  reader.  An  experimental  model  has  been  developed 


and  fabricated  by  the  Moscow  Design  Bureau  for  Cine  Equipment.  In  the 
folded  form  it  is  the  site  of  a  t  )k.  Reading  is  accomplished  through 
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an  eyepiece,  which  is  a  drawback  of  the  equipment.  The  enlargement  is 
from  6  to  40  times  (Pig.  24).  This  device  is  equipped  for  reading  mi¬ 
crophotocopies  on  roll  film  of  16  and  35-mm  width  or  microcards.  Inser¬ 
tion  of  the  microcards  for  reading  is  convenient  and  is  accomplished 
very  quickly.  Reading  may  be  accomplished  using  daylight  or  with  illu¬ 
mination  from  a  desk  lamp.  Built-in  illumination  is  provided  using  a 
miniature  bulb  supplied  from  a  pocket  flashlight  battery.  This  device 
may  be  used  to  read  microphotocopies  in  both  transmitted  and  reflected 
light . 


GRAPHIC  NOT 
REPRODUCIBLE 

Pig.  20.  Micro fot  reader. 


GRAPHIC  NOT 
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Pig.  21.  Attachment  to  Microfot 
reader  for  reading  aperture 
punchcards . 


The  Mark  VII  reader  of  the  Microcard  Corporation  (USA)  is  intended 
for  reading  mlcroph^tocopies  on  an  opaque  base  in  reflected  light  (Fig. 
25).  The  magnification  factor  is  24.  The  screen  sire  is  250  *  300  mm. 
The  device  has  a  convenient  arrangement  for  inserting  and  shifting  the 
microcard  during  reading,  is  equipped  for  operation  in  a  lighted  room; 
a  fan  is  provided  to  remove  the  head  from  the  illuminating  lamp. 
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Fig.  2?.  AChS-1  reader. 


GRAPHIC  WIT 
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Fig.  23.  AChM-2  reader. 


GRAPHIC  NOT 


Fig.  2^.  AChK-3  reader. 


GRAPHIC  NOT 
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Fig.  25.  Mark  VII  reader 
for  reading  microcards 
on  an  opaque  base  (in 
reflected  light). 
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Pig.  26.  Filmac-lOO  read-  Pig.  27.  PES  read?r-cop- 

er-copier,  ier. 

For  efficient  production  of  optical  (enlarged)  copies  from  the 
microphotocopies  being  read  use  is  made  of  reader-copiers  which  are  a 
combination  of  a  reader  with  a  device  for  rapid  copying.  The  rapid  pro¬ 
duction  of  optical  copies  is  based  on  the  corresponding  photographic 
processes  using  specialized  photographic  materials. 

The  Fllmac-100  reader-copier  of  the  firm  Minnesota  Mining  and  Man¬ 
ufacturing  (USA)  has  been  produced  since  1958.  The  enlargement  ratios 
are  7,  9,  15*  19*  and  26.  The  screen  size  Js  24o  *  265  mm  (Fig.  26). 

The  time  for  obt'  ning  a  single  optical  copy  is  3  sec  [15]. 

The  model  PES  reader-copier  of  the  Rekordak  firm  (USA)  has  been 
produced  since  1962  (Fig.  27).  The  screen  size  is  200  *  300  mm.  The  en¬ 
largement  ratio  for  reading  is  23,  for  printing  21.  The  time  for  pro¬ 
ducing  a  single  copy  is  1  minute. 

The  Dlophot  M-35  reader-copier  has  been  produced  by  the  Barton 
Mount  (USA)  firm  since  1962.  The  enlargement  ratio  is  up  to  15.  The 
screen  size  is  575  *  **30  mm.  The  time  for  producing  a  single  copy  is  no 
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more  than  25  sec.  The  electrostatic  method  of  producing  copies  is  used. 


2. 

3. 

4. 

5. 

6. 

7. 

8. 


REFERENCES 

A. S.  Tauber,  W.S.  Myers.  Photochromic  Micro-amges-a  Key 

to  Practical  Microdocument  Storage  and  Dissemination,  Paper 
presented  at  "National  Microfilm  Assoc, ,"Conf.  Washington, 
D.S.,  1962  April,  25-27. 

J.  Kosar.  Thermal  development  of  diazotype  papers.  -  Photo¬ 
graphic  Science  and  Engineering,  1961,  5,  No.  4,  pages  239- 
243. 

B.  Friedland.  Technological  Advances  in  the  Thermal  Diazo 
Process  -  Photographic  Science,  1962,  10,  pages  174-177. 

W.A.  Seifert,  W.F.  Elbrecht.  The  Kalvar  Process  -  Photographic 
Science  and  Engineering,  1961,  5,  No.  4,  pages  235-238. 

G.H.  Davison,  Microcards  and  Microfiches.  -  The  Library 
Association  Record,  1961,  63,  No.3»  pages  67-78. 

S.A.  Lipkin,  Q.G.  Vorob’yev,  L.P.  Shchegolev.  Modemizatsiya 
traditsionnykh  kartotek  s  primeneniyem  mikrofotokopirovaniya 
[Modernization  of  Traditional  Card  Files  with  the  Use  of 
Microphotocopying].  Nauchno-tekhnicheskaya  infonaatsiya, 

1963,  No.  7,  pages  9-11. 

S.A.  Lipkin,  G.G.  Vorob'yev.  Izgotovleniye  i  chteniye  aper- 
turnykh  perfokart.  [Preparation  and  Reading  Punch  Cards], 
Nauchno-tekhnicheskaya  informatslya,  1963,  No.  3,  pages  40- 
46. 

Special  report.  Companies  are  devising  new  techniques  for 
combating  what  is  coming  to  be  known  apprehensively  as  the 
nation's  "technical  information  crisis."  -  Chemical  and  En¬ 
gineering  News,  1961,  July  17,  pages  103-114. 


-  85  - 


9. 


10. 


11. 


12. 


13. 


14. 


15. 


Automatic  selection  and  reproduction  of  Industrial  documents 
Data  Processing,  1959 »  Ocotober-December . 

A.  minicard  system  for  documentary  information.  -  American 
Documentation,  1959,  October. 

R.L.  Laurent.  Magnacard.  -  Data  Processing,  1962,  4,  No.  10, 
pages  33-38. 

B. E.  Bushor.  Information  storage  retrieval.  -  Electronics, 
1962,  June  29,  pages  40-61. 

A  new  microcite  machine.  -  Technical  News  Bull.,  1961,  45, 

No.  7,  pages  113-115. 

P.D.  Bradshaw.  The  Walnut  system;  A  large  capacity  document 
storage  and  retrieval  system.  -  American  Documentation,  1962, 
Nc.  3,  270-275. 

National  microfilm  association.  Guide  to  microreproduction 
equipment.  Edited  by  Hubbard  W.  Ballou,  second  edition,  1962. 


Manu¬ 

script 

Page 

No. 

33 

33 

35 


37 

39 

41 


[Transliterated  Symbols] 

u  •  m  «  mikrofotokopiya  »  microphotocopy 
n  »  d  »  dokument  »  document 

HMK4M  ■  NIKFI  ■  Nauchno-issledovatel' skiy  kinofotoinstitut  ■ 
-  Scientific  Research  Institute  of  Cinemato¬ 
graphy 

rOCT  ■  GOST  ■  Oosudarstvennyy  obshchesoyuznyy  standart  ■ 
State  Standard 

KMHATI  ■  KINAP  •  kinoapparatura  •  cine  equipment 
MK5K  ■  MKBK  ■  Moskovakoye  konstruktorskoye  byuro  klno- 

apparatury  ■  Moscow  Design  Bureau  for  Cine 
Equipment 
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^eHKHHan  *  lenklnap  ■  Leningradskiy  zavod  kinoapparat'^y  * 

»  Leningrad  Cine  Equipment  Plant 
BHHHTM  ■  VINITI  ■  Vsesoyuznyy  institut  nauehnoy  i  tekhniches- 

koy  informatsii  ■  All-Union  Institute  of 

Scientific  and  Engineering 
Information 

MocropcoBHapxo3  *  Mosgorsovnarkhoz  *  Sovet  narodnogo  khoz- 

yaystva  Moskovsk^go 
gorodskogo  ekonomiches- 
kogo  rayona  «  Economic 
Council  of  the  Moscow 
City  Economic  Region 

KEM3  *  KEMZ  ■  kinoelektromekhanicheskiy  zavod  ■  cine  and 
electromechanical  plant 
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USE  OP  EDGE-PUNCHED  CARDS  FOR  ABSTRACT  INFORMATION 
ON  THE  BASIS  OF  THE  RZh  METALLURGIYA 

T.P.  Kolesnikova  and  A.V.  Shmeleva 
The  publication  of  the  abstract  journal  "Metallurgiya"  beginning 
in  1953  has  made  it  possible  to  concentrate  on  its  pages  the  metallur¬ 
gical  literature  which  is  randomly  scattered  in  the  scientific  and 
technical  Journals,  collections,  and  works  in  which  there  is  no  precise 
specialization  and  the  articles  are  published  in  any  of  these  publica¬ 
tions  at  the  whim  of  the  author  and  the  editors'.  Thus,  in  the  RZh  Met¬ 
allurgiya  during  the  five-ysar  period  beginning  with  1956  publications 
have  been  covered  from  39^.1  titles  of  periodical  and  serial  editions 
alone.  We  can  consider  that  the  RZh  Metallurgiya  provides  quite  com¬ 
plete  information  on  the  foreign  and  Soviet  material  published  In  ac¬ 
ceptable  time  periods.  It  has  become  considerably  easier  to  follow^ the 
entire  current  world  literature  on  any  question  of  metallurgy.  Figure  1 
presents  the  basic  indices  which  characterize  the  quantity,  volume,  and 
time  that  the  materials  are  in  the  metallurgy  section  of  VINITI. 

This  consolidation  was  absolutely  necessary  for  the  organization 
of  a  unified  flow  of  information  in  a  given  branch  of  science  or  engi¬ 
neering.  but  the  summary  volume  of  the  RZh  -  a  tremendously  bulky  and 
expensive  publication  -  could  be  acquired  only  by  large  organizations. 
In  very  few  libraries  could  the  reader  find  all  the  sections  of  the  RZh 
Metallurgiya.  The  following  step  toward  facilitating  the  search  for 
current  information  on  metallurgy  was  the  division  in  1958  of  the  sum¬ 
mary  volume  of  the  RZh  Metallurgiya  into  several  Issues  on  individual 
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fields,  clearly  demarcated  with  respect  to  subject  matter.  In  1958 
there  were  only  two  issues  on  the  production  of  iron  and  steel  and  on 
welding.  In  1964  the  number  of  such  issues  increased  to  ten:  theory  of 
metallurgical  processes;  metallurgical  heat  engineering;  control  and 
measurement  instruments  and  automation  of  metallurgical  production; 
production  of  iron  and  steel;  metallurgy  of  nonferrous  and  rare  metals; 
rolling  and  wire  drawing  production;  welding;  metal  science  and  heat 
treatment;  technical  analysis  in  metallurgy;  technology  and  equipment 
for  casting  production;  technology  and  equipment  for  forging-stamping 
production.  At  the  present  time  the  last  two  issues  do  not  appear  in 
the  combined  volume  of  the  RZh  Metallurgiya,  but  in  the  combined  volume 
of  Tekhnologiya  mashinostroyeniya  [Machinery  Technology].  Figures  2  and 
3  show  the  growth  by  years  of  the  number  of  subscribers  to  the  combined 
volume  and  the  abstract  sections  on  metallurgy.  We  see  from  the  figures 
that  the  efficiency  of  the  utilization  of  the  abstract  information  has 
increased  Sharply. 


Fig.  1.  Indices  of  the  RZh  Metallurgiya.  1)  N'>^ver  of  abstracts  in  com¬ 
bined  volume  per  year;  2)  size  of  combined  '  e  per  year.  In  avtor- 
skiy  list;  3)  density,  in  abstracts  per  1  av.or  .kiy  list;  4)  average 
time  for  processing  materials  in  the  metallurgy  section,  in  months,  a) 
Indices;  b)  years. 
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Pig.  2.  Circulation  of  the  RZh  Metallurgiya  and  its  issues  by  year.  1) 
Total  circulation  of  combined  volume  and  its  sections;  2)  number  of 
sections  in  the  combined  volume  of  RZh  Metallurgiya;  3)  circulation  of 
combined  volume,  a)  Indices;  b)  years. 

But  the  problem  of  the  tremendous  excess  of  information  in  the 
combined  volume,  somewhat  reduced  in  the  sections,  for  each  specialist 
has  continued  to  remain  unresolved. 

If  we  follow  the  path  of  further  fractionation  of  the  sections  of 
the  RZh,  then  this  will  inevitably  lead  to  an  increase  of  information 
loss,  since  with  the  present  technique  for  preparing  the  RZh  at  VINITI 
the  duplication  of  abstracts  in  the  various  sections  of  a  single  com¬ 
bined  volume  is  eliminated. 

The  intimate  interrelationship  of  the  branches  of  science  and  en¬ 
gineering  makes  it  very  difficult  to  centralize  all  information  in  is¬ 
suing  the  sections  of  the  RZh,  since  in  practice  it  is  impossible  to 
duplicate  abstracts  in  all  those  sections  in  which  this  would  be  re¬ 
quired  for  a  given  specialist. 

But  the  primary  deficiency  of  RZh  is  in  retrospective  search.  This 
search  with  the  use  of  the  index  is  quite  complicated,  slow,  and  does 
not  permit  the  use  of  any  mechanization.  Scanning  of  the  subject  indi¬ 
ces  for  several  years,  usually  Issued  with  some  delay,  writing  out  the 
numbers  of  the  abstracts  with  subject  matter  close  to  that  desired, 
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finding  the  necessary  copies  of  the  RZh,  and  the  abstracts  in  each  RZh, 
and  the  subsequent  digging  out  of  the  original  sources  -  this  is  the 
far  from  simple  way  in  which  the  information  search  is  conducted  at 
present. 
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Pig.  3»  Variation  of  circulation  with  number  of  sections  published  in 
parallel  with  the  combined  volume  of  the  RZh  Metallurgiya.  1)  Circula¬ 
tion;  2)  number  of  sections. 

The  referral  nature  of  the  subject  index  along  with  the  abstract 
nature  of  the  headings  leads  to  considerable  information  noise  during 
the  search.  The  present  breaking  down  of  the  headings  of  the  subject 
indices  makes  them  ever  more  cumbersome.  A  survey  of  the  scientific 
workers  in  the  field  of  metallurgy  has  established  that  the  search  for 
required  material  using  the  annual  subject  indices  to  the  RZh,  as  a 
rule,  does  not  reduce  the  time  in  comparison  with  search  through  the 
RZh  without  an  index. 

All  these  circumstances  indicate  the  serious  necessity  for  the  de¬ 
velopment  of  different  information  search  systems  which  will  make  it 
possible  to  accelerate  and  facilitate  the  retrospective  search,  provide 
a  multi-aspect  approach  in  each  given  search  and  in  time,  and,  finally, 
the  creation  of  individual  and  specialized  information  reference  sourc¬ 
es. 

We  considered  that  it  was  necessary  to  make  more  efficient  use  of 
the  basic  RZh  Metallurgiya  with  an  improved  process  for  systematizing 
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and  processing  the  primary  information.  Edge-punched  cards  answered  our 
requirements  most  completely. 

In  this  connection  at  the  VINITI  in  the  metallurgy  section  with 
the  aid  of  the  Scientific  Council  on  Cybernetics  of  the  Academy  of 
Sciences  USSR  there  was  developed  in  1963  and  published  in  1961*  9  (10 
in  1965)  punchcard  catalogs  on  metallurgy  using  the  standard  K5  punch- 
card  format  with  two-row  edge  perforations.* 

The  type  of  card  which  we  selected  permits  arranging  on  it  a  com¬ 
plete  abstract-  from  the  RZh  Metallur-giya,  which  to  a  considerable  de¬ 
gree  eliminates  the  referral  nature  of  the  card  file.  The  number  of 
perforations  intended  for  coding  the  subject  and  bibliographic  labels 
is  200,  quite  adequate  for  any  specialist. 

In  developing  the  card  prototype  we  attempted  to  create  a  standard 
prototype  which  would  be  clear,  simple  in  operation,  and  basically 
could  be  used  for  abstract  information  in  any  branch  of  science  and  en¬ 
gineering.  The  standardized  model  makes  it  considerably  easier  to  add 
to  the  information  reference  store  abstracts  from  any  related  fields. 

The  left  side  of  the  internal  field  of  the  card  and  the  corre¬ 
sponding  right  side  on  the  reverse  were  set  aside  for  the  abstract.  The 
right  side  remains  free.  Cn  special  order  a  window  may  be  cut  here  and 
a  microphotccopy  of  the  given  publication  mounted  (up  to  ^0  pages). 
Thus,  by  locating  the  text  on  the  free  field  of  the  punchcard,  we  ob¬ 
tain  a  very  convenient  microlibrary  with  the  possibility  of  information 
retrieval.  There  is  no  necessity  for  storing  either  specialized  or  ab¬ 
stract  Journals,  By  elementary  means  the  reader  extracts  only  those 
cards  which  are  obviously  of  Interest  to  him. 

It  was  decided  to  assign  the  great  majority  of  the  locations 
(about  150  and  any  combination  of  them)  for  coding  the  subject  labels 
which  reveal  the  contents  of  each  publication.^ 
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For  clarity  on  each  card  there  is  printed  a  stencil  with  notation 
of  the  subject  and  bibliographic  labels  using  coded  symbols  grouped  in 
individual  fields.  The  stencil  shown  in  Fig.  is  common  to  all  the 
card  files  of  the  RZh  Metallurgiya  and  in  the  future  will  be  extended 
to  related  fields.*  The  code  symbols  in  the  left,  right,  and  lower  mar 
gins  on  each  card  file  have  their  own  particular  meanings. 


Fig.  *i.  Punchcara  model.  1)  Author;  2)  publication 
type;  3)  field;  4)  country;  5)  issue;  6)  year;  7) 
materials . 


The  upper  margin  of  the  card  is  Intended  for  the  following  labels 
(from  left  to  right):  surname  of  author  (first  letter),  publication 
type,  branch  of  industry  and  country  on  which  information  is  presented 
in  the  publication,  number  of  the  card  file,  year  of  publication. 

The  subject  labels  of  each  card  file,  grouped  using  the  local  dec¬ 
imal  system,  are  coded  on  the  left  and  lower  margins  of  the  card.  Each 
of  the  ten  groups  is  assigned  a  field  with  10  openings,  numbered  using 
Arabic  numerals  frc-r.  0  to  9-  The  zero  field  and  the  sere  punch  of  each 
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group  are  intended  for  coding  general  questions  of  a  given  card  file  or 
a  given  group  of  labels.  The  last  field  and  the  opening  in  each  field 
designated  by  the  digit  9  are  reserve,  intended  for  coding  questions 
which  with  respect  to  subject  matter  relate  to  the  given  card  file  or 
given  group  of  labels,  but  which  were  not  reflected  in  the  originally 
developed  system.  Each  search  subject  label  has  its  own  index.  For  ex¬ 
ample,  the  index  II  2.1  denotes:  IV  is  card  file  IV  (Nonferrous  metal¬ 
lurgy),  2  is  the  label  (field)  2  (Smelting),,  1  i3  the  sublabel  (Tower 
smelting) . 

The  local  decimal  system  is  very  convenient,  since  it  is  simpie, 
clear,  easily  understood  by  the  people  who  code  the  information  docu¬ 
ment  during  its  publication  and  by  those  who  use  it.  It  is  not  diffi¬ 
cult  to  memorize  all  ten  basic  headings  of  the  card  file,  clearly  dif¬ 
ferentiated  with  respect  to  subject.  In  this  case  it  is  easy  to  detect 
the  need  for  shifting  the  information  document  to  a  different  card 
file.  This  system  makes  it  possible  to  express  clearly  in  the  search 
language  the  multi-aspect  nature  of  the  metallurgical  literature.  Ex¬ 
cessive  expansion  and  extension  of  the  hierarchy  of  the  search  language 
would  divide  the  attention  of  the  one  doin?  the  coding  and  would  create 
additional  difficulties  in  searching.  Both  these  factors,  as  is  known, 
lead  to  loss  of  information. 

The  right  margin  of  the  card  is  intended  for  coding  the  chemical 
elements  of  the  periodic  system,  metallurgical  production,  and  other 
materials  (^0  positions). 

Both  combined  and  direct  coding  methods  are  used  in  the  card  cata¬ 
logs  on  metallurgy.  The  combined  code  is  used  to  encode  the  biblio¬ 
graphic  mutually  exclusive  labels,  and  the  direct  code  la  used  for  the 
parallel  subject  labels  and  materials.  In  the  first  case,  to  each  bib¬ 
liographic  label  corresponds  a  combination  of  two  or  more  definite 
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notches.  In  the  first  case,  to  each  label  corresponds  a  single  definite 
type  of  notch:  a  shallow  notch,  corresponding  to  an  opening  of  the  out¬ 
er  row;  a  combining  notch  (slit)  is  a  label  corresponding  to  an  opening 
of  the  inner  row,  a  deep  notch  indicates  both  labels  together.  In 
searching  for  a  label  denoted  in  the  outer  row  of  openings,  the  needle 
is  inserted  in  the  outer  opening  and  the  cards  with  shallow  and  deep 
notches  full  out,  i.e, ,  all  the  cards  containing  the  desired  informa¬ 
tion.  When  searching  for  a  label  corresponding  to  the  inner  row  of 
openings,  the  needle  is  inserted  in  the  inner  opening  and  the  cards 
with  a  deep  cutout  fall  out;  the  stack  of  cards  remaining  on  the  needle 
are  grasped  with  the  left  hand,  the  needle  is  removed  and  inserted  in 
one  of  the  corner  openings  —  the  cards  with  a  slit  then  fall  out. 

Along  with  the  possibility  of  coding  the  contents  of  each  publica¬ 
tion  with  respect  to  all  the  headings  of  a  given  card  catalog  the  prin¬ 
ciple  of  direct  coding  of  subject  labels  with  the  aid  of  the  shallow 
and  slitted  notches  provides  completeness  and  the  absence  of  informa¬ 
tion  noise  when  extracting  cards  from  the  array. 

We  did  not  set  ourselves  the  task  of  creating  a  search  language 
which  would  completely  provide  for  information  search  with  respect  to 
very  narrow  subject  questions.  The  functions  of  our  headings  are  more 
distributive.  The  decimal  system  which  we  suggest  facilitates  the  crea¬ 
tion  of  a  search  language  necessary  for  the  highly  specialized  organi¬ 
zations  or  individual  subscribers. 

The  process  of  preparing  the  punchcards  for  publication  begins  in 
the  bibliographic  group  o*  the  metallurgy  section,  whose  workers  attach 
to  each  Incoming  publication  In  the  section  a  working  card  made  from 
plain  paper,  without  openings,  but  precisely  duplicating  the  punchcard 
format  whicn  has  been  adopted,  and  make  on  this  card  indications  of  the 
bibliographic  labels  in  ink:  author,  year  of  publication,  type  of  pub- 
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lication  with  respect  to  form.  A  working  number  is  immediately  written 
on  the  working  card.  At  the  stage  of  scientific  or  control  editing  of 
the  abstract  mark  on  the  working  card  all  the  labels  which  reflect  the 
contents  of  the  given  publication. 

Then  the  abstracts  are  gathered  under  the  headings  of  the  RZh  Met- 
allurgiya  into  an  issue,  are  sent  to  the  production  printing  combine  of 
VINITI  and  are  printed.  A  single  typesetting  is  used  for  printing  the 
abstracts  in  the  journal  and  on  the  punchcards.  At  the  locations  on  the 
punchcards  corresponding  to  the  code  marks  in  ink  on  the  working  card 
typographical  marks  are  made  -  .shadings .  The  notches  are  made  by  the 
user  himself  in  accordance  with  the  suggested  or  his  own  local  decimal 
classification  scheme. 

The  vacant  combinations  in  the  codes,  the  unused  openings  in  the 
fields,  and  the  free  fields  are  used  for  supplementing  the  card  catalog 
system  in  the  process  of  usage.  The  three  corner  perforations  them¬ 
selves  are  not  used,  since  they  perform  an  auxiliary  role  in  searching 
and  permit  the  extraction  from  the  card  file  of  erroneously  inverted 
cards.  When  the  classification  is  extended,  the  subject  headings  are 
given  new  values  by  partial  or  complete  changeover,  and  for  combined 
storage  a  signaling  notch  is  made  in  one  of  the  unused  corner  perfora¬ 
tions  of  the  new  cards.  In  the  case  of  a  slight  alteration  of  the  model 
all  the  cards  with  obsolete  notches  are  removed  from  the  card  file,  the 
notches  are  patched  cr  new  ones  are  made. 

It  should  be  noted  that  since  the  first  year  of  publication  of 
cards  with  edge  perforations  we  have  increased  the  number  of  subscrip¬ 
tion  issues  based  on  the  RZh  Metallurgiya  from  9  to  63.  The  average 
number  of  subscribers  to  each  set  had  doubled  by  1965.  Thus,  the  total 
circulation  of  the  RZh  Metallurgiya  with  its  sections  and  the  sets  of 
punchcards  is  about  18,500  copies  in  1965* 
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The  further  fractionation  of  the  subscription  sections  will  be  de¬ 
termined  by  the  profitability  and  the  organizational  capabilities  for 
their  distribution. 

An  analysis  of  the  operation  of  the  reference  information  sources 
based  on  the  K5  abstract  cards  will  be  made  after  some  time  has  passed. 

[Footnotes] 

This  work  was  performed  under  the  guidance  of  Candidate  of 
Technical  Sciences  T.P.  Kolesnikova. 

Since  1966  this  model  in  a  somewhat  modified  form  has  been 
accepted  as  mandatory  for  all  abstract  information  on  metal¬ 
lurgy  . 

[Transliterated  Symbols] 

P3K  *  RZh  *  referativnyy  zhurnal  *  abstract  Journal 
BHHHTH  *  V1NITI  ■  Vsesoyuznyy  institut  nauchnoy  i 

tekhnicheskoy  informatsii  *  All-Union  Insti¬ 
tute  of  Scientific  and  Engineering  Informa¬ 
tion 
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WAYS  OP  USING  INFORMATION  SEARCH  SYSTEMS  IN  MEDICINE 
N.M.  Amosov  and  O.P.  Mintser 

Medical  science  is  undergoing  great  difficulties  in  connection 
with  a  whole  series  of  aspects.  The  modern  ideas  on  the  physiological 
and  pathological  processes  within  the  organism  are  unusually  varied  and 
include  the  vast  knowledge  accumulated  over  the  centuries.  Thousands  of 
journals,  collections,  books  are  published  daily  devoted  to  the  results 
of  the  work  of  thousands  and  thousands  of  doctors  and  researchers. 

The  selection  of  Journal  articles  for  recent  years  on  a  particular 
subject  can  be  made  only  by  continuous  scanning  of  all  the  journals  of 
this  specialty.  However,  we  can  never  be  certain  that  we  have  not 
missed  something  important,  and  the  existing  bibliographies,  in  addi¬ 
tion  to  the  fact  that  they  are  far  from  complete,  are  published  with  a 
considerable  delay.  Hence  the  research  results  become  accessible  for  a 
wide  circle  of  doctors  only  when  their  timeliness  has  to  a  considerable 
degree  been  lost  in  connection  with  the  appearance  of  new  methods  of 
study  and  treatment.  This  is  why  the  treatment  institutions  prefer  to 
duplicate  studies  rather  than  find  their  results  in  the  literature. 

On  the  other  hand,  considerable  difficulties  in  finding  and  proc¬ 
essing  information  in  clinics  are  associated  with  the  numerous  methods 
for  treating  patients,  which  in  the  majority  of  cases  are  not  reflected 
in  the  appearance  of  new  forms  of  documentation.  The  records  in  histo¬ 
ries  of  disease  frequently  differ  in  their  extremely  disconnected  na¬ 
ture,  the  absence  of  systematization,  and  frequent  errors.  All  this 
makes  the  search  for  the  required  information  difficult. 
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In  medicine,  as  in  any  other  science,  a  definite  situation  is 
characterized  by  a  considerable  group  of  data.  This  leads  to  the  neces¬ 
sity  for  continuous  search  for  new  correlation  connections.  A  simple 
table  for  100  given  studies,  let  us  say,  of  100  patients,  with  the  aid 
of  which  we  can  determine  the  nature  of  a  few  combinations  of  charac¬ 
teristics  already  leads  to  definite  difficulties.  With  increase  of  the 
number  of  conditions  in  the  group  the  possibility  of  establishing  rela¬ 
tionships  is  practically  lost. 

Only  a  few  years  ago  the  narrow  specialization  of  scientists  aided 
in  the  solution  of  problems  of  data  reduction.  Actually,  In  some  small 
field  it  is  easier  to  find  the  information  of  Interest,  and  observa¬ 
tions  in  specialized  hospitals  of  a  homogeneous  group  of  patients  made 
it  possible  to  draw  certain  conclusions.  But  today  narrow  specializa¬ 
tion  offers  only  partial  aid,  since  attendant  illnesses  and  complica¬ 
tions  may  be  missed. 

An  essential  limitation  in  information  search  and  processing  is 
the  absence  of  a  unified  classification  of  diseases,  complications,  and 
patho-physiological  shifts  with  respect  to  the  various  organs  and  sys¬ 
tems.  This  hinders  the  exchange  of  Information  between  the  treatment 
institutions  and  prevents  an  improvement  of  the  precision  of  clinical 
medicine. 

Of  considerable  assistance  in  overcoming  these  difficulties  is  a 
new  branch  of  knowledge  which  has  been  termed  documental  information 
system  theory.  The  study  of  this  subject  has  become  a  science  in  it¬ 
self,  and  primary  attention  is  devoted  to  questions  of  creating  infor¬ 
mation  search  systems  -  IPS.  The  application  of  these  systems  in  clin¬ 
ics,  considering  the  possibility  of  subsequent  entry  of  the  data  into  a 
unified  medical  Information  center,  may  yield  results  in  the  very  near 
future  whose  Importance  it  is  difficult  to  overestimate. 
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First,  the  IPS  aid  in  overcoming  the  gap  between  the  outstanding 
and  small  clinics,  between  the  specialists  and  the  general  practition¬ 
ers,  which  is  the  result  of  the  possibility  of  using  the  experience  of 
all  the  treatment  institutions. 

Second,  we  can  overcome  a  considerable  portion  of  the  subjectivity 
associated  with  the  inadequate  qualification  of  doctors  in  evaluating 
the  information  obtained. 

Third,  the  IPS  aid  in  the  process  of  establishing  a  diagnosis, 
when  the  question  arises  of  what  minimal  amount  of  research  (which  the 
clinical  is  capable  of  conducting,  of  course)  must  be  carried  out  with 
respect  to  the  patient  so  that  the  diagnosis  will  be  most  reliable, 
while  the  studies  themselves  will  present  the  least  risk,  and  the 
technique  for  their  conduct  will  be  sufficiently  simple.  This  means  the 
practical  implementation  of  the  problem  which  has  long  faced  medicine  - 
a  maximum  of  data  with  a  minimum  of  analysis.  The  urgency  of  this  last 
question  does  not  require  any  comment  -  it  suffices  to  note  that  in  the 
clinics  tens  of  analyses  are  frequently  conducted,  and  a  correct  and 
reliable  diagnosis  is  still  not  obtained. 

There  are  many  versions  of  the  information  search  systems,  which 
in  principle  are  divided  into  the  following  4  types:  1)  puncbcard;  2) 
using  microfilms;  3)  based  on  the  use  of  electronic  computers;  4)  com¬ 
bined  systems. 

Most  widely  used  for  processing  comparatively  small  arrays  of  in¬ 
formation  are  the  punchcard  systems,  while  the  digital  computers  are 
used  for  processing  large  amounts  of  information.  We  shall  not  discuss 
the  history  of  creation  of  IPS,  we  simply  note  that  in  foreign  medical 
practice  systems  for  accumulating,  storing,  and  retrieval  of  informa¬ 
tion  are  widely  used.  In  the  Soviet  Union  the  information  search  system 
was  first  developed  at  the  A. A.  Vishnevskiy  Institute  of  Surgery  in 


Moscow.  Work  is  proceeding  on  the  design  of  clinical  IPS  in  Leningrad, 
Minsk,  and  other  cities. 

However ,  the  programs  for  finding  the  most  efficient  system  are 
being  carried  out  too  independently.  Therefore,  in  the  Thoracic  Surgery 
Clinic  of  the  Ukraine  Scientific  Research  Institute  for  Tuberculosis 
and  Chest  Surgery  work  is  being  done  on  developing  a  unified  plan  for 
studies  in  this  direction.  As  the  first  stage  it  is  proposed  that  the 
elements  be  developed  for  a  new  approach  to  the  evaluation  of  clinical 
information,  and  we  should  shift  over  to  this  approach  immediately. 

1.  STANDARDIZATION  OF  METHODS  OF  STUDYING  THE  PATIENT  AND  EVALUATING 
THEIR  RESULTS 

The  existing  techniques  for  conducting  clinical  examination  of  pa¬ 
tients  are  already  out  of  date.  Each  treatment  institution  performs 
these  studies  using  its  own  plan,  even  if  the  series  of  tests  are  the 
same  in  scope,  the  evaluation  of  their  results  frequently  differs. 
Without  even  discussing  the  various  forms  of  recording,  this  hinders 
the  comparison  of  data  obtained  by  the  clinics.  Closely  related  to  this 
situation  is  the  question  of  objectivization  of  the  methods  of  examina¬ 
tion  and  the  standardization  of  equipment. 

Thus,  a  unified  scheme  for  conducting  analyses,  a  generally  ac¬ 
cepted  gradation  of  the  results,  and  their  evaluation  have  become  an 
urgent  necessity. 

2.  CREATION  OF  UNIFIED  CLINICAL  CLASSIFICATIONS 

The  necessity  for  this  work  Is  a  result  of  the  fact  that  certain 
clinics  continue  to  divide  a  particular  illness  into  three  stages, 
others  use  four  stages,  a  third  group  may  not  classify  the  disease  at 
all. 
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In  addition,  at  the  present  there  is  no  strict  systematic  classi¬ 
fication  of  complications,  there  is  no  clarity  in  the  definition  of 
patho-physiological  shifts  with  respect  to  the  organs  and  systems  for 
different  diseases.  Finally,  there  are  no  criteria  for  evaluating  the 
efficiency  of  the  diagnostic  and  treatment  methods, 

3.  STANDARDIZATION  OF  MEDICAL  DOCUMENTATION 

Within  the  clinics  the  creation  of  unified  schemes  for  documenta¬ 
tion  is  going  forward  in  two  stages,  first,  transfer  to  punchcards  (or 
to  any  other  material  information  carrier)  of  the  data  of  periodic  lab¬ 
oratory  and  biochemical  analyses,  doctors'  notations,  etc.  This  is  the 
so-called  primary  documentation  (first  level  of  the  information  system! 

Second,  the  transfer  of  the  entire  volume  of  information  contained 
in  the  history  of  the  illness  in  a  compressed  historical  information 
form  (second  information  level).  An  example  of  the  organization  of  such 
an  information  system  will  be  given  below. 

Even  these  steps  alone  considerably  improve  the  degree  of  preci¬ 
sion  of  clinical  medicine,  particularly  in  specialized  hospitals  with 
a  large  number  of  similar  patients.  However,  in  the  unspecialized  or 
small  treatment  institutions  this  is  obviously  not  enough.  It  is  neces¬ 
sary  to  create  a  medical  information  center. 

Academician  V.M.  Glushkov  has  played  an  important  role  in  the  de¬ 
velopment  of  the  idea  of  creating  an  information  center. 

The  problems  here  are  the  following: 

1)  the  development  of  a  standardized  classification  of  diseases 
together  with  the  outstanding  specialists  and  scientific  societies; 

2)  the  creation  of  a  standardized  form  of  medical  documentation 
for  all  clinics,  convenient  for  transferring  the  information  to  the 
IPS: 
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3)  collection  and  storage  of  medical  information; 

4)  processing  data  from  the  literature  and  comparing  them  with  re¬ 
sults  obtained  in  the  clinics; 

5)  formulation  of  complex  diagnoses  and  selection  of  optimal 
treatment  methods; 

6)  conduct  of  statistical  and  correlation  calculations; 

7)  coordination  of  the  scientific  studies  associated  with  statis¬ 
tical  processing  of  clinical  material; 

8)  aid  the  treatment  institutions  in  introducing  the  new  methods 
of  diagnosis. 

We  picture  the  medical  information  center  as  an  organization  hav¬ 
ing  the  complex  structure  whose  schematic  may  be  as  shown  in  Pig.  1. 

All  the  information  collected  using  the  unified  scheme  is  fed  ei¬ 
ther  to  the  collection  section,  where  it  is  subjected  to  analysis,  or 
to  the  reference  section,  if  it  contains  several  inquiries.  From  the 
collection  section  the  data  enter  the  archives,  and  from  the  reference 
section  the  data  enter  the  computer  room.  In  the  memory  of  the  digital 
computer  there  are  stored  programs  for  retrieval  of  diagnostic,  treat¬ 
ment,  and  prognostic  information,  on  the  basis  of  which  a  plan  is  for¬ 
mulated  for  seeking  similar  cases. 

On  the  basis  of  the  materials  from  the  archives  of  disease  history 
and  the  literature  data,  the  digital  computers  provide  a  summary  of 
similar  cases  with  some  degree  of  approximation,  a  probabilistic  diag¬ 
nosis,  and  an  optimal  treatment  method.  However,  in  the  process  of  for¬ 
mulating  the  diagnosis  a  version  is  possible  in  which  considerable  ad¬ 
ditional  Information  is  required  for  a  more  precise  diagnosis.  Then  the 
request  for  this  additional  information  is  sent  back  to  the  treatment 
Institution.  The  procedure  is  similar  in  the  treatment  process,  when 
the  continuously  chaining  symptoms  lead  to  the  need  for  constant  and 
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continuous  refinement  of  the  treatment  method  and  require  a  determina¬ 
tion  of  the  possible  prognosis  of  the  patient’s  condition. 


Pig.  1.  Schematic  of  medical  information  center.  1)  Scientific  council; 
2)  libraries;  3)  literature  processing  section;  4)  reference  section; 

5)  clinic;  6)  section  for  developing  new  research  methods;  7)  program¬ 
ming  section;  8)  information  input  section;  9)  ETsVM;  10)  publication 
section;  11)  archives;  12)  computer  station  MSS. 

It  is  clear  that  the  information  material  will  ue  transmitted  over 
communication  channels  both  in  the  form  of  digital  data  and  in  the  form 
of  curves.  A  division  of  the  Institute  of  Cybernetics  of  the  Academy  of 
Sciences  of  the  likrainian  SSR  is  working  on  the  possibility  of  such 
transmission. 

In  the  archives,  where  information  from  the  clinic  and  literature 
data  enter,  use  must  be  made  of  the  latest  systems  for  accumulating, 
storing,  and  retrieving  the  required  information.  The  vast  system  memo¬ 
ry  is  the  result  of  the  large  amount  of  data,  while  the  necessity  for 
retrieval  of  Information  in  many  cases  in  the  course  of  a  short  time 
Interval  requires  quite  high  selection  speed  of  the  IPS.  Finally,  the 
frequent  referral  to  the  system  and  the  need  for  outputting  all  possi¬ 
ble  copies  of  the  originals  requires  the  use  of  copying  and  duplicating 
equipment  in  the  Information  center. 

The  second  part  of  the  archives,  intended  for  recording  and  accu- 
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mulatlng  literature  materials,  uses  the  data  from  the  abstracting  sec¬ 
tion.  The  latter  will  be  in  communication  with  all  the  libraries  and 
abstracting  centers  of  the  country  and  will  exchange  information  with 
foreign  institutes.  In  addition,  use  will  be  made  of  the  work  of  the 
scientific  researchers  cf  the  various  medical  institutions. 

The  medical  information  center  is  a  facility  in  which  a  large 
amount  of  factual  and  literature  material  is  accumulated.  Information 
concerning  the  latest  researches  and  treatment  methods  come  here  first. 
Finally,  the  center  has  varied  equipment  for  processing  data.  Therefore 
it  is  advisable  that  the  medical  information  center  nublish  communica¬ 
tions  based  on  the  considerable  clinical  and  literature  material. 

From  these  same  aspects  it  becomes  obvious  that  the  information 
center  must  offer  consulting  and  procedural  assistance  to  the  clinics 
in  developing  and  verifying  new  methods.  In  addition,  it  may  also  per¬ 
form  coordination  functions.  The  latter  is  particularly  important  in 
connection  with  the  fact  that  In  spite  of  all  the  methods  of  reporting 
the  number  of  duplicative  studies  (which  involve,  moreover,  a  tremen¬ 
dous  number  of  researchers^  Increases  every  year. 

The  formulation  of  programs  for  meuical  information  search,  the 
evaluation  of  new  methods  of  diagnostics  and  treatment  will  be  particu¬ 
larly  valuable  if  they  are  performed  by  specialists.  Therefore  the  par¬ 
ticipation  of  outstanding  scientists  in  the  information  center  is  de¬ 
sirable. 

The  principle  of  gradualness  must  be  observed  In  organizing  the 
Information  service.  First  small  information  search  centers  are  created 
within  the  clinics.  These  information  systems  will  first  be  concerned 
with  individual  nosological  units  or  groups  of  diseases.  At  this  stage 
of  the  work  we  can  make  an  evaluation  of  the  research  methods,  ieter- 
rr.ir.e  approximately  their  reliability,  and  consequently  identify  the 


most  important  and  definitive  methods. 

A  unified  information  center  is  created  on  the  basis  of  a  group  of 
such  systems.  As  the  second  step  there  must  be  statistical  processing 
of  the  characteristics  and  a  determination  of  the  different  versions  of 
the  course  of  the  illnesses.  This  makes  it  possible  to  find  a  clinical 
precedent  with  adequate  degree  of  accuracy. 

The  accumulation  of  clinical  data  permits  determining  for  the  sur¬ 
gical  clinics  inotructions  for  operations,  the  risk  of  operative  inter¬ 
vention,  and  the  condition  of  the  patient  in  the  near  and  distant  post¬ 
operative  period.  Later  on  it  will  be  pesoj-ole  to  formulate  a  disease 
program,  which  constitutes  the  third  stage. 

It  is  obvious  that  in  connection  with  changes  of  medical  view¬ 
points,  objectivization  of  the  research  data,  the  appearance  of  new  di¬ 
agnostic  methods  and  treatment  methods  a  periodic  review  of  the  ar¬ 
chives  will  be  required,  first  within  the  clinical  systems,  and  then  in 
the  medical  information  center.  Free  space  for  entering  modified  and 
additional  information  must  be  provided  for  this  purpose  on  the  infor¬ 
mation  carrying  material. 

As  an  example  let  ue  examine  t*e  construction  of  an  information 
search  system  implemented  In  the  previously  mentioned  Thoracic  Surgery 
Clinic.  Its  general  arrangement  is  planned  in  the  following  form  (Fig. 
2). 

The  primary  information  is  all  the  data  obtained  in  examining  the 
patient.  Tnis  information  is  fed  periodically  into  the  informa*. ion 
search  system.  For  example,  for  each  day  of  the  post-operatr/e  period 
we  propose  to  use  a  single  machine-sorting  punchcard  (the  first  level 
of  the  information  system  discussed  above).  With  the  aid  of  this  card 
catalog  we  will  be  able  in  the  future  tc  determine  the  versions  of  the 
course  of  several  illnesses.  After  release  of  the  patient  the  entire 
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history  of  the  sickness  will  be  entered  in  the  IPS,  from  which  data  are 
selected  for  the  second  level  of  the  system.  This  selection  is  per¬ 
formed  with  the  aid  of  specially  developed  epicrises.  They  record  the 
entire  complex  of  investigations  introduced  into  the  system  for  infor¬ 
mation  search,  information  groups,  and  indices.  The  groups  are  grada¬ 
tions  of  numerical  quantities.  In  the  process  of  their  compilation  at 
first  literature  information  is  used,  after  which  they  are  modified  us¬ 
ing  the  data  from  our  clinic  by  means  of  preliminary  statistical  proc¬ 
essing  of  ICO  histories  of  illnesses  and,  finally,  are  discussed  with 
the  leading  specialists.  The  information  indices  are  the  record  of  the 
columns  and  positions  of  the  machine  punchcards  corresponding  to  the 
given  characteristic. 
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Fig.  2.  a)  Level;  b)  reference  card  file;  c)  laboratory  card  file;  d) 
anaesthesiological  card  file;  e)  current  Information  card  file;  f)  lab¬ 
oratories;  g)  clinical  sections;  h)  anesthesiological  section. 

Thus,  in  the  epicrises  there  are  located  complete  data  on  the  pa¬ 
tient  and  the  corresponding  information  groups  are  recorded.  All  this 
permits  obtaining  the  required  information  from  the  epicrises  and  per¬ 
mits  the  verification  of  the  correctness  of  entry  of  the  data  onto  the 
punchcards.  The  information  indices  are  written  out  by  technical  per¬ 
sonnel. 

A  complex  of  computing  and  perforating  machinery  is  used  in  the 
clinic.  Its  application  is  Justified,  first,  by  the  frequency  of  refer- 
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ral  to  the  archives,  the  rapid  and  continuous  growth  of  the  archives; 
second,  by  the  necessity  for  conducting  computer  operations;  third,  by 
the  desire  to  then  introduce  data  from  the  machine  punchcards  into  the 
electronic  equipment.  At  the  present  time  archives  of  a  history  of  ill¬ 
nesses  involving  heart  defects  is  being  developed.  The  volume  of  infor¬ 
mation  from  each  history  exceeds  1500  characteristics.  They  are  ar¬ 
ranged  on  three  punchcard  models.  The  first  is  intended  for  recording 
the  data  of  the  pre-operational  period,  the  second  is  devoted  to  the 
operational  period,  while  the  third  records  the  characteristics  of  the 
course  of  the  illness  in  the  post-operative  period. 

The  punchcards  are  correlated  by  the  number  of  the  illness  history 
and  by  the  year  that  the  patient  entered  the  clinic,  and  are  differen¬ 
tiated,  first,  by  the  color,  which  makes  it  possible  to  separate  them 
visually,  and  second,  by  the  specially  coded  first  digit  of  the  illness 
history  number,  which  makes  it  possible  to  perform  machine  selection  of 
the  cards. 

On  each  punchcard  space  is  left  for  entering  auditional  informa¬ 
tion  which  is  the  result  of  studies  which  are  not  mandatory.  The  re¬ 
verse  side  serves  for  a  brief  record  of  unforeseen  characteristics  of 
the  course  of  the  illness.  Finally,  on  each  punchcard  there  are  record¬ 
ed  the  conclusions  from  the  two  other  cards  relating  to  the  given  pa¬ 
tient,  in  particular  the  diagnosis,  the  success  of  the  operation,  and 
its  immediate  results.  The  latter  makes  it  possible  to  perform  an  inde¬ 
pendent  processing  of  the  punchcards. 

Preference  is  given  to  direct  coding,  which  facilitates  decoding 
of  the  material  and  the  retrieval.  However,  in  several  cases,  for  exam¬ 
ple  in  the  part  of  the  second  punchcard  concerning  the  hemodynamics  of 
the  operational  period,  and  in  the  third  punchcard,  devoted  to  the 
changes  of  the  symptoms  in  the  post-operative  period,  use  is  made  of 
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the  combined  type  of  coding.  In  addition,  on  the  second  and  third 
punchcards  use  is  made  of  a  special  scheme  for  the  characteristic  of 
the  symptom  variations:  the  magnitude  of  a  characteristic  was  within 
definite  limits  in  the  post-operative  (operational)  period.  If  an  in¬ 
crease  (decrease)  is  noted,  a  remark  is  made  of  the  level  of  the 
change,  in  what  stage  it  occurred,  whether  it  occurred  once  or  several 
times,  and  for  how  long. 

The  anaesthesiological  records  will  also  be  entered  in  the  IPS  of 
our  clinic.  They  are  recorded  on  manual  punchcards,  a  prototype  of 
which  is  being  developed.  On  their  unused  field  there  will  be  located 
graphs  of  the  hemodynamic  variations,  and  along  the  borders  there  will 
be  information  on  the  use  of  medications,  the  nature  of  the  operation, 
characteristics  of  the  narcosis,  etc.  In  addition,  there  will  be  a  grid 
drawn  in  which  the  number  of  the  column  of  the  machine  punchcard  corre¬ 
sponding  to  the  recorded  characteristic  is  noted.  The  latter  is  done 
for  convenience  in  transferring  data  to  the  machine  punchcards. 

A  similar  card  file  is  also  being  created  for  the  clinical  labora¬ 
tories,  however  in  this  case  machine  punchcards  will  be  used. 

Finally,  we  note  that  each  punchcard  is  duplicated  so  that  one 
copy  is  located  in  the  sickness  history  records,  a  second  in  the  ar¬ 
chives,  and  the  third  in  a  spare  card  file. 

h.  CONCLUSIONS 

The  following  operations  may  be  performed  with  the  use  of  the  sys¬ 
tem  described  above. 

1.  Search  for  a  clinical  precedent.  The  soluti  >n  of  this  problem 
permits  at  the  very  first  stages  of  the  system  operation  the  determina¬ 
tion  of  the  risk  of  operative  intervention,  the  modification  of  a  diag¬ 
nosis,  and  evaluating  the  possible  severity  of  the  patient  condition  in 


the  post-operative  period. 

2.  Verify  the  reliability  of  several  methods  of  research. 

3.  Perform  statistical  and  correlation  calculations. 

4.  Determining  the  efficiency  of  treatment  methods. 

Naturally,  the  suggested  scheme  is  still  far  from  perfect,  the 

widest  possible  range  of  doctors  and  researchers  of  the  scientific  and 
technical  information  institutes  must  participate  in  its  development. 
However,  the  results  of  the  use  of  the  information  search  systems  in 
medical  practice  will  not  be  long  in  being  felt  and  even  at  the  very 
first  stages  they  will  aid  in  improving  the  precision  of  clinical  medi¬ 
cine. 
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[Transliterated  Symbols] 

HITC  «  IPS  «  informatsionno-poiskovaya  sistema  ■  information 
search  system 

3UBM  »  ETsVM  ■  elektronnaya  tsifrovaya  vychislltel'naya 
mashina  «  electronic  digital  computer 
MCC  ■  MSS  ■  mashinna-schetnaya  stantsiya  -  computer  station 
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SOLUTION  OP  SOME  INFORMATION  LOGIC  PROBLEMS 
OP  GENERAL  CHEMISTRY  ON  ELECTRONIC  DIGITAL  COMPUTERS 

A.L.  Seyfer 

In  the  present  paper  we  shall  consider  four  basic  trends  in  the 
creation  of  Information  logic  systems  for  general  chemistry.  These  are 
the  automatic  information  search  for  chemical  compounds  on  the  basis  of 
their  physical  and  chemical  properties;  the  accumulation,  processing, 
and  retrieval  of  information  using  composition-property  diagrams;  no¬ 
menclature  automatic  translation;  and  the  information  logic  methods  of 
calculating  properties  of  compounds. 

We  shall  describe  primarily  the  investigations  made  at  the  Labora¬ 
tory  of  Electi’osimulation  of  the  Institute  of  Scientific  Information  of 
the  Academy  of  Sciences  USSR.  An  extensive  review  of  the  present  state 
of  the  question  is  presented  in  [1], 

I.  AUTOMATIC  INFORMATION  SEARCH  FOR  CHEMICAL  COMPOUNDS  USING  THEIR 
PHYSICAL  AND  CHEMICAL  PROPERTIES  [2] 

In  this  section  we  shall  consider  very  briefly  the  experience  in 
use  of  electronic  digital  (computing  and  informational)  machines  for 
creating  a  machine  information  search  system  using  the  physical  and 
chemical  properties  of  individual  (primarily  nonorganic)  compounds. 
Sometimes  this  system  is  termed  an  "electronic  handbook."  The  creation 
of  such  systems  is  becoming  more  urgent  with  every  year  which  is  asso¬ 
ciated,  naturally,  with  the  very  rapid  growth  of  chemical  information. 

Generally  speaking,  each  concrete  or  even  potentially  possible 
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(even  including  all  chemistry)  array  of  chemical  information  may  be 
considered  from  the  formal  point  of  view  as  some  universal  class  V.  The 
elements  of  this  class  will  be  all  properties  of  all  chemical  compounds 
and  any  combinations  of  them  (solutions,  melts,  reactions,  etc.),  and 
also  all  conceivable  characteristic  conditions  of  determining  the  prop¬ 
erties  of  each  chemical  object.  The  formulas  of  the  chemical  compounds, 
and  also  their  systematic  and  trivial  names  and  composition-property 
diagrams  for  the  systems  of  chemical  compounds  are  also  elements  of  the 
universal  class,  just  as  al.l  the  rest.  However,  the  obvious  specific 
nature  of  such  elements  forces  us  to  consider  them  separately.  This 
applies  most  of  all  to  the  structural  formulas  of  the  chemical  (primar¬ 
ily  organic)  compounds,  for  which  the  methods  of  machine  treatment  are 
described  in  detail  in  [3].  The  questions  of  nomenclature  translation, 
and  also  the  machine  recording  and  processing  of  the  composition-prop¬ 
erty  diagrams  will  be  discussed  in  the  corresponding  sections  of  the 
present  paper.  Here  we  shall  deal  with  the  subclass  V',  including  in 
itself  all  those  elements  of  the  universal  class  which  relate  only  to 
the  properties  of  the  individual  chemical  compounds  and  the  character¬ 
istic  conditions  for  determining  these  properties. 

The  properties  of  the  inorganic  compounds  are  collected  in  the 
electronic  handbook  described  below.  Here  we  shall  arbitrarily  consider 
inorganic  compounds  to  be  those  which  have  in  their  composition  no  more 
than  two  carbon  atoms  in  a  single  radical  or  ligand.  Of  all  the  classes 
of  chemical  compounds  it  is  primarily  these  inorganic  compounds  (and  of 
the  organic  compounds  only  the  hydrocarbons)  which  have  been  studied 
relatively  completely.  Therefore  in  the  general  mass  of  chemical  com¬ 
pounds  they  stand  out  by  the  relative  completeness  of  the  collection  of 
properties  which  have  been  studied.  This,  incidentally,  to  a  consider¬ 
able  degree  stimulated  the  selection  of  these  compounds  as  the  objects 
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for  the  information  search  system. 

In  describing  substances,  modern  chemistry  uses  approximately  300 
physico-chemical  properties.  We  have  selected  120  of  the  most  important 
properties:  thermochemical  (heat  and  temperature  of  phase  transitions, 
formation  energy  and  entropy,  etc.),  thermophysical  (vapor  pressure, 
solubility,  density,  viscosity-temperature  relations,  etc.),  structural 
(bond  lengths  and  angles  between  them),  optical  (refraction,  polariza¬ 
tion,  etc.),  magnetic,  electric  (dipole  moments,  conductivities,  etc.), 
etc.  It  should  be  noted  that  the  set  of  properties  may  be  expanded  not 
only  by  using  the  physical  and  chemical  constants,  but  also  using  in¬ 
formation  on  the  adaptability,  biological  activity,  crystalline  struc¬ 
ture  of  a  particular  compound,  etc. 

From  the  formal  point  of  view  all  the  constants  may  be  divided  in¬ 
to  three  types: 

a)  binary  criteria  ("yes”  -  "no"),  such  as  "does  a  given  substance 
dissolve  in  benzene,"  etc.; 

b)  numerical  characteristics,  i.e.,  the  intrinsic  values  of  par¬ 
ticular  constants; 

c)  the  coefficients  of  the  equations  describing  the  variation  of 
the  magnitude  of  some  property  with  variation  of  the  conditions,  for 
example  the  coefficients  of  the  equations  of  the  type 

Cf  »  a,  +  *i7*  + 

or 

•fiP  *  A  +  TTT  • 

where  C ’  is  the  heat  capacity  at  a  given  temperature;  P  is  the  vapor 
pressure  of  the  compound;  t  and  r  are  the  temperatures  In  degrees  C 
and  K;  4,  8,  Ct  vQt  o^,  are  empirical  coefficients. 

As  a  rule,  for  the  constants  of  the  type  "a"  and  "b"  there  are  not 
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always  standard  characteristic  conditions  of  definition  and  therefore 
in  many  cases  these  constants  must  be  specifically  specified,  i.e.,  to¬ 
gether  with  the  magnitude  of  the  property  we  indicate  the  conditions 
under  which  it  was  determined.  Each  formal  type  of  constant  requires 
its  own  methods  of  recording  and  its  methods  of  processing  during 
search.  We  should  not  one  specific  characteristic  of  the  formulation 
and  organization  of  an  information  array  on  the  physical  and  chemical 
properties  of  compounds  which  is  associated  with  the  fact  that  for 
many  (particularly  the  more  simple  and  popular)  compounds  in  the  liter¬ 
ature  we  encounter  several  values  of  a  particular  constant  which  differ 
from  one  another.  Therefore,  not  taking  the  responsibility  for  the  se¬ 
lection  of  a  particular  so-called  recommended  value  of  the  constant,  we 
must  present  all  the  values,  specifying  each  time  the  literature  source. 

1.  Forms  of  Search  Strategy 

In  the  automatic  information  search  system  for  the  inorganic  com¬ 
pounds  there  are  defined  three  types  of  questions  which  may  be  speci¬ 
fied  or,  as  is  more  frequently  said,  three  types  of  search  strategy. 

1.  Find  in  the  machine  memory  and  printout  one  or  several  definite 
properties  of  a  specific  compound. 

2.  Find  in  the  machine  memory  all  the  properties  of  a  given  com¬ 
pound  and  print  them  out.  In  this  case,  if  for  a  given  compound  there 
are  also  recorded  the  coefficients  of  an  equation,  it  is  necessary  to 
calculate  the  values  of  the  property  defined  by  this  equation  with  a 
given  step  of  variation  of  the  argument  and  print  out  the  entire  table 
or  a  portion  of  the  table.  In  the  case  when  certain  constants  have 
somewhat  differing  values.  It  is  necessary  to  print  out  all  these  val¬ 
ues  with  an  indication  of  the  literature  sources. 

3.  Find  in  the  machine  memory  and  print  out  all  the  compounds  hav- 
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ing  a  certain  specified  set  of  properties.  For  example,  all  the  com¬ 
pounds  containing  sulfur  but  not  containing  chlorine,  having  a  dipole 
moment  which  is  nonzero,  melting  in  the  limits  150-l80°C,  boiling  no 
higher  than  270°C,  etc.  Such  a  type  of  search  strategy  is  most  suitable 
for  the  practical  problems  of  selecting  a  material  with  given  proper¬ 
ties. 

2.  Organization  of  the  Information  Array  in  the  Machine  Memory 

In  spite  of  the  fact  that  the  inorganic  compounds  have  been  stud¬ 
ied  more  than  the  others,  only  7-8*  of  all  the  possible  constants  or 
the  entire  array  are  available.  The  remaining  92-93X  constitute  the  so- 
called  void  constants.  The  latter  are  explained  primarily  by  the  still 
inadequate  state  of  Investigation  cf  particular  compounds  and,  to  a 
lesser  degree,  by  the  inapplicability  of  a  given  type  of  constant  to  a 
given  compound  (for  example,  the  index  of  refraction  of  an  aqueous  so¬ 
lution  of  a  compound  which  is  not  soluble  in  water,  etc.). 

Therefore,  the  simplest,  tabular  system  of  recording  the  constant^ 
where  each  property  of  each  compound  is  assigned  its  own  memory  cell, 
is  not  really  applicable.  In  this  case  the  required  memory  volume  must 
be  increased  by  a  factor  of  12-13*  which  naturally  leads  to  approxi¬ 
mately  the  same  increase  of  the  search  time  .  This  last  situation  is  as¬ 
sociated  with  the  fact,  as  will  be  shown  below,  that  the  processing 
time  for  any  information  subarray  in  the  operational  machine  memory  is 
incomparably  less  than  the  time  for  entry  into  this  operational  memory 
of  the  succeeding  subarray  from  tue  external  memory  units:  magnetic 
tape,  punched  cards,  etc.  However,  the  increase  of  the  information  vol¬ 
ume  leads  to  a  nearly  directly  proportional  increase  cf  the  total  time 
for  input  of  the  information  from  the  external  memory  units. 

Naturally,  the  primary  problems  of  organization  of  the  information 
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Fig.  1.  1)  iV  compounds;  2) 
"length”  of  the  compound;  3) 
scale;  4)  binary  characteristic; 
5)  end-of-compound  mark;  6)  end- 
of~zone  mark;  7)  end-of-array 
mark. 


array  in  the  machine  memory  are  the  problems  of  information  compres¬ 
sion,  the  struggle  against  the  previously  mentioned  void  constants. 

When  operating  with  the  universal  computer  which  was  first  utilized  the 
information  array  was  organized  so  that  a  separate  section  of  the  ma¬ 
chine  memory  was  assigned  to  a  single  compound.  In  places  1-12  of  cell 
k  there  is  recorded  the  number  cf  the  compound,  and  in  places  13-24  of 
this  same  cell  there  is  entered  a  number  equal  to  the  number  of  cells 
assigned  to  the  given  compound.  The  cells  k  ♦  1  and  tc  ♦  2  are  assigned 
to  the  scales.  A  "1"  in  a  definite  in  a  definite  positiou  ot  the  scale 
cell  indicates  that  a  property  value  is  recorded  for  the  given  com¬ 
pound.  The  number  of  the  place  in  the  first  scale  cell  or  the  number  of 
the  place  it.  the  second  scale  cell  plus  24  (24  is  the  machine  word 
length)  corresponds  to  the  number  of  the  property  in  the  previously 
fixed  list  of  38  properties.  The  last  10  places  of  the  k  ♦  2  cell  are 
assigned  to  the  binary  characteristics  (is  the  compound  soluble  in 

C6H6*  cpH50H»  CC14»  oRd  so  on)‘ 

The  values  of  the  pt  slco-chemlcal  properties  are  stored  in  cells 


w, 
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<c  ♦  3»  *  +  4,  . . . ,  ic  +  n.  In  this  case,  if  the  value  of  some  property 
is  missing  in  the  array,  the  cell  does  not  remain  empty,  but  in  it 
there  is  stored  the  value  of  the  following  property,  etc.  Thus,  if  in 
the  scale  cells  there  are  ones  only,  for  example,  in  2,  6,  16,  24,  31 
(7  +  24)  positions,  then  the  values  of  the  properties  corresponding  to 


these  positions  are  recorded  in  sequence  in  the  same  order  and  occupy 
only  5  cells  in  place  of  38  for  the  case  of  the  tabular  form.  The  cells 
ie  +  n  +  1,  if  +  »  +  2,  ic  +  n  +  3  are  assigned  for  the  marks  "end  of  com¬ 
pound,"  "end  of  zone*  (there  are  3-4  compounds  in  a  zone),  and  "end  of 
entire  array." 

In  the  Ural-4  machine  the  information  array  concerning  the  physi¬ 
co-chemical  properties  of  the  inorganic  compounds  Is  organized  somewhat 
differently.  The  long  machine  word  length  permitted  replacing  the 
scales  by  a  peek-a-boo  matrix.  The  latter  is  a  line-by-line  simulation 
of  superposition  punched  cards  T4].  The  structure  of  the  peek-a-boo  ma¬ 
trix  is  shown  in  Fig.  2.  Here  a  single  definite  compound  is  placed  in 
correspondence  to  each  identical  position  of  a  group  cf  cells  (column). 
Each  cell  (matrix  row)  corresponds  to  a  single  definite  property.  Thus, 
in  a  single  subarray,  "packet,"  there  are  gathered  40  compounds.  The 
number  of  properties  in  the  set  is  not  fixed  ahead  of  time.  A  portion 
of  the  matrix  cells  Is  assigned  for  the  binary  characteristics.  Ones  in 
a  definite  position  of  any  cell  indicate  the  presence  in  the  memory  of 
a  value  of  the  property  p.  of  the  compound  a..  The  values  themselves 
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of  the  properties  are  recorded  In  the  same  fashion  as  In  the  case  when 
the  scales  are  used.  First  the  constants  of  the  first  compound  are  re¬ 
corded  sequentially,  then  the  second,  etc.  After  the  values  of  the 
properties  in  the  individual  cells  there  are  written  the  addresses  of 
the  subarrays  with  the  equation  coefficients.  At  the  end  of  the  array 
there  are  written,  symmetric  to  the  values  of  the  constants,  references 
to  the  source  from  which  this  constant  was  taken,  and  after  all  this 
there  are  written  the  addresses  of  the  so-called  duplicates  of  the  con¬ 
stants,  i.e.,  the  second,  third,  ...  values  of  certain  constants  (with 
their  literary  references),  for  which  the  first  values  are  presented  in 
the  basic  part  of  the  subarray. 

TABLE  1 
TeO 

0OI+2-M71O0O  I  1069 
006+2+365000  10297 

Since  on  the  Ural-1!  machine  only  a  single  packet  is  stored  on  a 
single  zone  of  the  magnetic  tape,  there  is  no  need  to  indicate  the "num¬ 
ber  of  the  cells  assigned  to  a  compound  or  to  specially  number  the  com¬ 
pounds.  The  number  of  the  compound  is  automatically  determined  by  the 
number  of  the  packet,  and  within  the  packet  it  is  determined  by  its 
place  in  the  packet.  On  the  average  a  single  packet  (40  compounds)  oc¬ 
cupies  approximately  500-700  cells. 

We  use  the  method  of  preparing  the  array  with  storage.  It  is  in¬ 
tended  for  preparation  of  information  for  a  set  of  alphanumeric  per¬ 
forators  with  the  objective  of  subsequent  input  into  the  Ural-4  ma¬ 
chine.  As  the  information  arrives  it  is  entered  on  cards  for  perforat¬ 
ing.  A  sample  of  one  card  is  shown  in  Table  1.  On  it  there  are  written 
out  and  coded  the  formation  energies  for  gaseous  (property  001)  and 
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liquid  (property  006)  states  of  the  compound  TeO  together  with  the  lit¬ 
erature  citations  (1069  and  10297).  These  cards  are  accumulated  con¬ 
stantly,  independent  of  the  perforation  process.  During  perforation  on¬ 
ly  a  single  property  of  a  single  compound  is  entered  on  each  punchcard. 

Before  entry  into  the  Ural-4  machine,  a  selection  is  made  on  a 
conventional  sorter  of  all  the  punchcards  relating  to  a  single  compound. 
The  array  which  has  been  subsorted  in  this  fashion  can,  in  addition  to 
its  basic  use  for  input  to  the  Ural-4,  pass  through  the  printer  unit  of 
a  tabulator  and  obtain  a  summary  of  the  physicochemical  properties  of 
all  the  compounds  accumulated  over  some  definite  time  period. 


Pig.  3.  The  Ge-Si  system  and 
its  physico-chemical  graph.  1) 
at. 


The  translation  of  the  array  from  the  input  language  to  the  ma¬ 
chine  language  is  accomplished  automatically  by  specialized  translation 
programs.  The  basic  idea  in  these  programs  is  the  conversion  of  the 
punctuation  form  in  the  input  languages  to  a  position  code  and  primari¬ 
ly  the  formation  of  scales  or  a  peek-a-boo  matrix.  The  description  of 
the  translation  program  is  the  subject  of  a  separate  study. 

3.  Search 

The  search  programs  are  formulated  so  that,  depending  on  the  in¬ 
terrogation,  i.e.,  on  the  type  of  search  strategy,  particular  blocks  of 
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the  unified  search  program  are  activated.  The  blocks  of  this  program 
arc  the  following:  processing  of  the  peek-a-boo  matrix,  scale  shift, 
comparison  of  the  value  of  a  property  with  the  limiting  values  speci¬ 
fied  by  thet question,  and  calculation  using  the  equations. 

The  block  for  matrix  processing  operates  with  a  series  of  ques¬ 
tions,  each  of  which  is  essentially  the  disjunctive  normal  form  of  the 
binary  characteristics  (property  numbers).  The  result  of  the  block  op¬ 
eration  is  a  table  of  scales  —  TO,  each  scale  of  which  contains  the 
numbers  of  the  compounds  in  a  packet  which  have  the  given  property  (re¬ 
gardless  of  its  numerical  value). 

The  second  block  is  intended  for  finding  in  the  information  array 
the  required  numerical  values  of  the  constants  from  the  address  gener¬ 
ated  by  the  preceding  block.  The  third  block  compares  these  numerical 
values  with  the  limiting  values  specified  by  the  question  and,  finally, 
the  last,  fourth  block  calculates  the  values  of  the  constants  specified 
by  the  equation  coefficients.  Activation  of  a  particular  block  is  ac¬ 
complished  by  a  control  subprogram  depending  on  the  type  of  search  in¬ 
struction. 

Experimental  operation  of  the  search  program  on  a  machine  with  the 
scale  form  of  information  array  for  3-^  compounds  in  one  zone  gave  the 
following  results.  Using  the  first  and  second  types  of  search  instruc¬ 
tion  about  2  min  is  necessary  for  an  answer  to  a  single  question  (not 
counting  the  time  for  input  of  the  question  itself).  This  time  is  made 
up  of  the  zone  search  time  -  about  80-90  sec,  the  search  in  the  zone  - 
3  sec,  and  the  time  for  printing  the  result  -  30-^0  sec.  Using  the 
third  type  of  search  instruction  for  each  zone  the  times  were:  20-25 
sec  for  entry  of  the  next  zone,  6-8  sec  for  searching  in  the  zone,  and 
5-10  sec  for  printing  the  result  of  the  search  for  a  given  zone.  Thus, 
in  all  the  processing  of  a  single  zone  occupies  31-^3  sec,  which  for 
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all  the  material  of  inorganic  chemistry  would  give  about  30  hours.  Nat¬ 
urally,  this  length  of  time  for  a  continuous  scan  of  the  entire  array 
is  excessively  long  and  makes  the  entire  system  of  little  use  in  prac¬ 
tice. 

However,  the  use  of  the  peek-a-boo  matrix  on  the  Ural-4  machine 
permitted  a  significant  reduction  of  the  search  time.  Thus,  for  the 
third  type  of  search  instruction  ("find  all  the  compounds  having  a  def¬ 
inite  set  of  properties")  complete  processing  of  a  single  packet  to¬ 
gether  with  the  time  necessary  for  entry  into  the  operational  memory  of 
the  succeeding  packet  of  40  compounds  does  net  exceed  5-6  sec.  A  con¬ 
tinuous  sort  of  the  entire  array  and  the  search  in  the  array  for  the 
required  compounds  for  all  inorganic  chemistry  is  accomplished  in  20-25 
min.  However  the  search  using  the  first  two  types  of  search  instruc¬ 
tions  lasts  20-50  sec  on  the  Ural-4  machine. 

Here  we  should  note  that  using  the  described  scheme  we  can  con¬ 
struct  automatic  search  systems  for  other  than  the  chemical  compounds. 
We  can  assume  that  the  metals  and  their  alloys,  biological,  economic- 
statistical,  and  other  objects  may  serve  as  input  information  for  such 
a  system.  An  attempt  has  recently  been  made  to  use  an  electronic  hand¬ 
book  for  creating  an  information  search  system  for  the  metals  and  al¬ 
loys  [5]. 

II.  INFORMATION  STORAGE,  PROCESSING,  AND  RETRIEVAL  USING  THE  COMPOSI¬ 
TION-PROPERTY  DIAGRAMS 

A  somewhat  unique  field  of  application  of  the  information-logic 
methods  of  chemistry  is  the  information  concerning  systems  of  substanc¬ 
es,  i.e.,  the  field  which  is  studied  by  physico-chemical  analysis.  Mod¬ 
ern  chemistry  deals  ever  more  frequently  not  only  with  the  individual 
chemical  compounds,  but  also  with  their  combinations  -  systems.  Alloys 
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of  the  metals  and  salts,  solutions,  etc.,  are  very  widely  used  by  engi¬ 
neering,  which  has  led  to  the  need  for  their  systematic  study  by  the 
methods  of  physico-chemical  analysis.  This  in  turn  has  given  rise  to 
and  Is  continuing  to  give  rise  to  the  accumulation  of  data  concerning 
the  equilibrium  chemical  systems  in  ever  increasing  scale. 

Just  as  in  certain  other  branches  of  chemistry,  physico-chemical 
analysis  has  its  international  language  —  the  geometrized  language  of 
the  composition-property  diagrams.  The  composition-property  diagram  it¬ 
self,  according  to  the  definition  of  N.S.  Kurnakov,  "is  a  graphical 
representation  of  that  complex  function  which  defines  the  ratio  be¬ 
tween  the  composition  and  the  properties  of  homogeneous  bodies  or 
phases  which  are  formed  in  a  system"  [6J.  Figures  3-7  show  several 
specimens  of  such  diagrams.  The  accepted  technique  of  representing  in 
the  form  of  certain  geometric  patterns  the  variation  of  some  physical 
property  as  a  function  of  the  physico-chemical  interactions  in  a  system 
as  a  function  of  the  variation  of  the  composition  has  led  to  a  tendency 
to  treat  certain  typical  topological  structures  on  the  diagrams  as  one- 
to-one  representations  of  the  physico-chemical  transformations  which 
take  place  in  the  systems.  Seemingly,  such  a  tendency  is  quite  fruitful 
if  only  because  it  is  essentially  an  attempt  at  some  standardization 
and  formalization  of  all  the  accumulated  material.  However,  such  stand¬ 
ardization  and  formalization,  unfortunately,  is  not  all-encompassing, 
since  in  actuality  there  is  not  a  complete  one-to-one  correspondence 
between  the  physico-chemical  transformations  in  the  system  and  their 
"typical"  geometrical  representations.  This  implies  that  the  geometri¬ 
cal  structure  of  the  diagram  in  itself  cannot  be  considered  as  complete 
information  concerning  the  system.  Therefore,  we  ’  -t  consider  all 
three  aspects  of  the  system  (and  the  diagram  which  represents  it)  as 
Inseparably  related:  the  topological,  metrical,  and  chemlsm  (chemical 
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essence)  aspects.  Consequently,  the  machine  language  of  physico-chemi¬ 
cal  analysis  must  take  Into  account  these  three  aspects  of  the  form  of 
the  Information  concerning  the  equilibrium  systems  and  at  the  same  time 
with  respect  to  its  capabilities  must  be  equivalent  tc  the  real  larg- 
uage  of  physico-chemical  analysis.  By  machine  language  of  the  physico¬ 
chemical  analysis,  we  mean  here  a  definite  system  of  recording  in  the 
memory  and  logical  processing  by  the  machine  of  the  experimental  and 
theoretical  information  with  the  objective  of  solving  information  prob¬ 
lems.  In  the  real  language  of  physico-chemical  analysis  the  following 
typical  information  problems  are  solved. 


Pig.  iJ.  The  levo-carroxime/ 
dextro-carroxime  system  and  its 
physico-chemical  graph.  1)  mol. 


Pig.  5.  The  Cu-Au  system  and  its 
physico-chemical  graph.  1)  at. 

1.  Finding  a  concrete  composition-property  diagram  (for  example, 
the  diagram  of  the  states  of  the  Sl-Ge  system  or  the  electrical  conduc¬ 
tivity-composition  diagram  of  the  system  HjSO^-HgO). 

2.  Finding  all  the  diagrams  for  a  given  system  of  substances  (for 
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Pig.  6.  Diagram 
of  the  condensed 
state  of  a  binary 
system  with  com¬ 
plete  solubility 
in  the  liquid 
state  and  com¬ 
plete  insolubil¬ 
ity  in  the  solid 
state,  with  the 
formation  of  an 
incongruently 
melting  chemical 
compound.  1)  Zh. 


t  a  a 


Pig.  7.  Diagram 
of  condensed 
state  of  a  binary 
system  with  com¬ 
plete  solubility 
in  the  liquid 
state  and  complete 
insolubility  in 
the  solid  state, 
with  the  forma¬ 
tion  of  a  chemi¬ 
cal  compound 
which  is  not  dis¬ 
sociated  in  the 
liquid  state.  1) 
Zh. 


example,  all  the  composition-property  diagrams:  density,  electrical 
conductivity,  saturated  vapor  pressure,  refraction,  etc.,  for  the  sys¬ 
tem  K4[Pe(CN)6]-K20). 

3.  Finding  the  diagrams  for  systems  in  which  one  or  more  of  the 
components  are  in  the  general  class  of  the  chemical  compounds  (for  ex¬ 
ample,  all  the  state  diagrams  for  alloys  of  the  Mg  chlorides  with  the 
chlorides  of  the  alkali  metals  or  the  composition-electrical  conductiv¬ 
ity  diagrams  for  the  systems  HgO-normal 

4.  Identify  systems  with  a  common  structure  for  the  entire  diagram 
or  for  individual  segments  of  the  diagram  (for  example,  liquid  systems 
with  a  maximum  of  the  vapor  pressure  or  condensed  systems  with  extrema 
of  the  initial  melting  temperatures). 

5.  Determine  the  property  magnitude  for  a  given  composition  for 
one  or  more  systems  of  substances  (for  example,  the  crystallisation 
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temperature  of  all  alloys  containing  27  at.  %  Zn  or  the  melting  points 
of  the  alloy  82$  Pe  and  l&wt.  %  Ni,  etc.). 

6.  Find  the  entire  combination  of  substances  with  definite  proper¬ 
ties  or  with  a  definite  combination  of  properties  (for  example,  the 
composition  by  components  of  mixtures  of  all  salts  melting  above  320° C 
without  decomposition  and  having  a  density  higher  than  1.7  g/cm^,  etc.). 

These  same  information  problems  must  also  be  solved  in  the  devel¬ 
oped  machine  language  of  physico-chemical  analysis.  Let  us  consider  the 
three  forms  of  description  in  machine  language  which  correspond  to  the 
three  aspect?'  of  the  real  language  of  physico-chemical  analysis.  The 
description  principles  described  below  have  been  developed  basically 
for  the  simplest  and  at  the  same  time  most  frequently  encountered  case 
of  binary  systems.  The  majority  of  these  principles  are  also  applicable 
to  the  description  of  three-component  and  multi-component  systems. 

Topological  description.  N.S.  Kurnakov  introduced  into  physico¬ 
chemical  analysis  the  topological  representations,  i.e.,  the  considera¬ 
tion  of  the  composition-property  diagrams  as  a  closed  topological  com¬ 
plex.  In  describing  topological  structures  by  binary  composition-prop¬ 
erty  diagrams  it  is  necessary  to  take  into  account  the  basic  character¬ 
istics  of  the  contemporary  electronic  machines  for  information  process¬ 
ing  -  they  all  require  that  the  information  be  represented  in  linear¬ 
ized  form  for  input,  i.e.,  that  the  information  consist's  of  a  linear 
sequence  of  symbols  in  machine  code.  It  is  obvious  that  the  composi¬ 
tion-property  diagrams  for  the  binary  and  particularly  for  the  three- 
component  and  multi-component  systems  are  essentially  nonlinear.  Thus, 
any  diagram  of  a  binary  system  regardless  of  its  chemical  significance 
la  a  two-dimensional  closed  complex,  composed  of  points,  lines,  and 
surfaces  (fields).  Because  of  the  nonhcmogenelty  of  such  an  initial 
complex  its  representation  in  linearised  form  is  associated  with  over- 


coming  certain  difficulties.  When  separating  from  this  complex  a  refer¬ 
ence  graph  consisting  only  of  lines  (edges  of  the  graph)  and  points 
(vertices)!  Information  will  be  lest  concerning  the  fields  and  their 
relative  positions  In  the  diagram  since  for  such  a  graph  several  dif¬ 
ferent  plane  realizations  are  acceptable. 

For  the  two-dimensional  topological  complexes,  these  difficulties 
(In  particular,  for  the  binary  system  diagrams)  may  be  avoided  by  using 
the  duality  principle  and  introducing  "coloration”  of  the  vertices,  and 
also  special  "boundary-vertices.”  The  graph  constructed  with  the  use  of 
these  techniques  and  uniquely  describing  the  composition-property  dia¬ 
gram  for  the  binary  systems  has  been  termed  a  physico-chemical  diagram 
[7],  The  sequential  restructuring  of  the  original  graph  into  the  physi¬ 
co-chemical  graph  may  be  followed  in  Fig.  8,  where  as  the  original  com¬ 
plex  we  have  taken  a  typical  diagram  of  a  system  with  a  eutectic  point. 
First  we  construct  a  graph  which  Is  dual  to  the  original.  To  do  this  a 
point  of  the  field-vertex  of  the  dual  graph  A  (Fig.  8,  A)  is  correlated 
with  each  field  of  the  diagram.  The  entire  region  lying  outside  the 
closed  comple::  forms  an  empty  field,  to  which  the  empty  vertex  A  of  the 
graph  is  correlated.  The  graph  A  thus  obtained  describes  only  the  rela¬ 
tive  positioning  of  the  fields  in  the  original  diagram.  After  con¬ 
structing  the  dual  graph  each  intersection  of  the  edges  of  this  graph 
with  an  edge  of  the  original  graph  is  replaced  by  a  point,  a  "boundary- 
vertex."  In  the  newly  formed  graph  B  (Pig.  8,  B)  the  boundary-vertices 
are  located  between  the  field-vertices.  For  each  field  of  the  diagram 
their  number  is  equal  to  the  number  of  lines  separating  the  given  field 
from  the  others.  Therefore,  for  example,  the  field  lying  below  the  sol¬ 
idus  line  in  Pig.  8  is  described  by  the  single  field  vertex  8  and  by 

five  boundary-vertices  a,  which  causes  the  appearance  of  new  edges  in 

* 

the  graph.  By  definition  the  vertices  of  type  a  will  always  be  vertloes 
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of  only  second  order*  two-ray.  The  typi  $  vertices  will  have  an  order 
no  less  than  2,  part  of  them  will  be  nodes*  i.e.,  vertices  with  order 
>.3*  Thus*  In  the  graph  B  there  are  vertices  of  two  types*  two  colors. 

In  graph  theory  such  constructions  are  normally  termed  graphs  with  la¬ 
beled  vertices.  In  certain  cases*  for  the  description  It  Is  convenient 
to  color  each  vertex  In  Its  own  color,  which  permits  In  a  single  repre¬ 
sentation  having  information  concerning  the  general  structure  of  the 
diagram  and  concerning  its  concrete  chemical  content.  In  this  case  the 
vertices  of  type  a*  which  are  formed  from  a  single  line  of  the  original 
diagram*  always  have  the  same  color*  as  shown  In  Fig.  8*  C. 


A  B  C 


Fig.  8.  Sequence  of  construction  of  a  physico-chemical  graph.  A)  Graph 
which  is  the  dual  of  the  original;  B)  graph  with  vertices  of  two  col¬ 
ors;  C)  physico-chemical  graph. 

There  are  several  methods  of  linear  representation  of  graphs.  We 
have  used  one  of  the  simplest  -  the  construction  of  a  star  table. 

Let  a^*  o^*  a#,  .  ,.*  be  the  boundary-vertices,  and  8^*  8t, 

8a  the  fie1 d-vert Ices  o-  a  physico-chemical  graph.  Then  each  row  of  the 
star  table  appears  as  follows: 

*.*»«8*-  •  •  . 

I.e.*  following  each  symbol  of  any  field-vertex  there  are  written  suc¬ 
cessively  all  the  branches  of  the  graph  emanating  from  this  vertex. 

Bach  branch  consists  of  two  vertices:  the  nearest  vertex  is  type  8,  and 
that  lying  between  two  such  vertices  is  type  a.  Essentially  each  row  of 
the  table  is  the  symbolic  representation  of  one  deflnr  t  field  of  the 
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diagram  with  an  indication  of  the  neighboring  field  and  the  bounding 
lines.  The  complete  set  of  rows  for  all  vertices  of  the  physico-chemi¬ 
cal  graph  type  may  be  written  linearly.  For  example, 
diagram  without  extrema  of  the  melting  points: 

I  Pt — 3jpi.i4-\,  «>\,  a,A  [, 

A  —  «A.*iP]ia|PilS]pi|: 

diagram  with  a  maximum  of  the  melting  points: 

$4  —  a.  A,  «iPj.  «iPi  I  Pi  —  aiP»>  i  Pi — <*iP».«A!Pj  — 

<*jpi.  «*iA,  a,  A,  a,  A I A — «A.  aA.  a*?,.  *  A  i: 

diagram  with  a  minimum  of  the  melting  points: 

P,  —  at\,  HA,  a,P,. !  Pt  —  a  A.  *A  i  Pi  —  *A.  |. 

P> — aA.  *&.  «.A,  a*  A,  a, A  J  A  —  aA.  »#3,« 

In  practice  for  the  initial  input  of  the  physico-chemical  graph 
for  further  processing  on  a  computer  the  symbols  of  the  vertices  of 
type  o  and  B  usually  are  numbers  obtained  by  arbitrary  numbering  of  the 
lines  of  the  original  diagram. 

With  further  processing  (canonization)  of  the  diagram  these  num¬ 
bers  are  replaced  using  a  special  program  by  symbols  denoting  both  the 
phase  composition  of  the  fields  and  the  chemism  of  the  phase  transi¬ 
tions  (lines)  in  the  system.  Here  we  must  point  out  that  when  coding 
the  search  strategy  all  questions  associated  with  the  search  for  gener¬ 
al  or  concrete  topological  structures  must  also  be  represented  1  the 
form  of  a  description  of  some  physico-chemical  graph.  For  example,  such 

a  graph  for  the  solid  solutions  will  be  o-B.o.,  where  B.  j.s  a  nonnodal 

1  3  "  0 

vertex.  Then  the  finding  of  the  answer  to  such  a  question  will  be  the 
search  using  a  special  program  for  the  embeddings  of  our  subgraph 
(question)  in  all  the  graphs  described  in  the  machine  memory. 

The  analytic  description  of  the  diagram  is  a  representation  of  its 
metrics.  In  a  narrow,  concrete  sense  the  metrics  of  a  chemical  diagram 
is  the  relationship  between  the  magnitude  of  a  property  and  the  compo- 
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sition  of  the  system.  This  relationship  is  reflected  either  graphically 
on  a  diagram  constructed  from  a  table  of  experimental  data  or  analyti¬ 
cally  with  the  aid  of  some  equation.  Numerous  attempts  have  been  made 
to  find  theoretical  (based  only  on  the  properties  of  the  original  com¬ 
ponents  and  obtained  by  Interaction  of  the  components  of  the  compounds) 
relationships  for  the  lines  of  the  composition-property  diagram  which 
lead  to  equations  describing  qualitatively  the  Interactions  In  the  sys¬ 
tem,  but  which  do  not  offer  the  possibility  of  performing  quantitative 
calculations.  On  the  other  hand,  even  Mendeleyev  in  his  studies  on  so¬ 
lutions  solved  successfully  the  problem  of  approximating  experimentally 
the  resulting  curve  on  the  composition-property  diagrams  by  equations 
of  parabolas.  Among  the  recent  studies  in  this  field,  of  interest  are 
the  investigations  of  Reshetnikov,  who  has  very  successfully  approxi¬ 
mated  the  isotherms  of  the  binary  systems  -by  expressions  which  in  the 
final  analysis  are  reducible  to  an  equation  of  the  form 

miX*+C;X- f  fl, 

where  y  is  the  magnitude  of  some  system  property;  x  is  the  content  of 
one  of  the  components,  %\  a.,  a l  are  coefficients  which  are  determined 
empirically  by  the  method  of  least  squares  [81. 

Therefore  we  have  used  as  the  basis  for  the  analytic  description 
of  the  chemical  diagrams  of  binary  systems  the  representation  of  the 
chemical  diagram  as  a  combination  of  lines  expressed  in  analytical  form 
[9-11] .  In  this  case  each  line  of  the  diagram  reflects  the  functional 
relationship  y  »  f(x),  where  y  is  the  value  of  a  property  of  the  binary 
system  of  substances  of  a  given  composition,  x  is  the  percentage  of  one 
of  the  components.  As  we  have  already  mentioned  above,  so  far  there  are 
no  theoretically  derived  relationships  for  all  the  cases  of  the  proper¬ 
ty  lines.  Therefore  there  remains  only  one  approach  -  the  conversion 
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of  tabular  data  or  data  taken  from  a  curve  into  some  empirical  formula. 
This  is  essentially  the  replacement  over  a  limited  interval  of  the  com¬ 
plex  functional  dependence  of  the  system  properties  on  the  properties 
of  the  original  components  by  a  relatively  simple  empirical  formula. 

First  we  tried  she  Lagrange  interpolation  polynomials.  In  spite  of 
the  relative  simplicity  of  the  interpolation  process  itself,  the  memory 
volume  occupied  by  the  data  for  interpolation  is  quite  large  and  the 
analytical  form  of  the  diagram  constructed  on  the  base  of  the  interpo¬ 
lation  formulas  i3  extremely  complex.  However  the  primary  shortcoming 
of  the  interpolation  formulas  in  application  to  the  chemical  diagrams 
is  their  low  accuracy. 

Thus,  the  Lagrange  interpolation  polynomial 
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for  one  of  the  property  lines  of  the  state  diagram  cf  the  Bi-Te  system 
at  individual  points  gives  i  deviation  up  to  20-25%  between  the  calcu¬ 
lated  and  experimental  values. 

Incomparably  better  results  are  given  by  approximating  the  proper¬ 
ty  line  by  some  polynomial  Uoing  the  method  of  least  squares.  In  se¬ 
lecting  the  approximation  method  and  the  form  of  the  approximating  pol¬ 
ynomial  we  must  take  into  account  two  to  some  degree  contradictory  fac¬ 
tors:  the  simplicity  of  the  process  and  the  accuracy  of  the  results 
obtained.  On  the  basis  of  universality  considerations  we  chose  a  con¬ 
ventional  power  polynomial  cf  the  form: 

g  ■»  +  a.-**"-'  +  •  ■  •  +  +  co¬ 

in  the  approximation  using  this  polynomial  on  a  digital  computer 
we  took  into  consideration  the  following  factors. 

1.  The  degree  of  the  approximating  polynomial  was  not  established 
ahead  of  time,  but  was  worked  out  automatically  each  time  on  the  basis 
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of  a  given  maximal  value  of  the  deviation.  For  the  state  diagram,  fol-  f 

lowing  Gromakov  [12],  we  took 

,  Tmu 

where  T  is  the  maximal  value  of  the  temperature  on  the  given  curve. 
max 

2.  All  points,  including  those  relating  to  the  pure  components, 
are  of  equal  value.  The  arguments  that  on  the  state  diagrams  the  points 
corresponding  to  the  temperatures  of  the  phase  transitions  of  the  pure 
components  are  absolutely  reliable  quantities  while  all  the  remaining 
points  are  considerably  less  reliable  seems  to  us  to  be  in  error.  Nu¬ 
merous  facts  of  revision  of  individual  constants  of  the  individual  sub¬ 
stances  are  the  best  proof  of  this.  In  any  case,  there  are  no  bases  to 
consider  that  in  a  given  series  of  experiments  the  parameters  of  the 
substances  involved  are  identical  to  the  parameters  of  a  specially  pur¬ 
ified  and  precision  measured  compound. 

3.  The  approximation  Is  performed  using  the  method  of  least 
squares  with  solution  of  the  required  systems  of  linear  equations  by 
the  Gauss  method. 

The  first  experiments  showed  that  with  this  method  of  approxima¬ 
tion  it  is  easy  to  achieve  the  required  accuracy  and  the  resulting  pol- 
ynomiuls  rarely  have  a  degree  greater  than  ^-5.  For  the  usual  smooth 
curves  without  essential  extrema  (nearly  all  the  lines  on  the  diagrams 

I 

are  of  this  type)  normally  the  3rd  degree  is  adequate.  Thus,  for  the 
analytic  description  of  the  diagram  there  are  entered  into  the  machine 
memory  for  each  line  the  coefficients  of  the  polynomial  in  decreasing 
powers  of  the  variable  and  the  abscissas  of  the  er.d  points  of  the 
curve,  i.e.. 

The  system  described  above  for  representing  the  metrics  of  the  dl- 

i 
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agram  requires  having  during  implementation  search  algorithms  in  the 
machine  memory  and  some  program  for  calculating  the  polynomial.  We  took 
for  this  purpose  the  simplest  Horner  scheme  which  may  be  used  to  calcu¬ 
late  the  values  from  a  given  value  of  x.  However,  the  Horner  scheme  is 
not  suitable  for  such  search  problems  as  the  determination  of  the  prop¬ 
erty-composition  from  a  given  quantity.  Therefore,  it  was  necessary  ei¬ 
ther  to  supplement  the  analytical  description  by  programs  for  the  ap¬ 
proximate  solution  of  high-order  equations  or  to  expand  the  analytic 
representation  itself,  introducing  into  it  polynomials  which  are  in¬ 
verted  relative  to  the  variables.  We  investigated  both  of  these  ap¬ 
proaches.  Supplementing  the  analytic  expression  itself  required  a  doub¬ 
le  approximation,  i.e.,  for  each  property  line  we  introduced  into  the 
memory  two  polynomials: 

y =/(*);* 

This  doubles  the  time  for  the  initial  machine  processing  of  the 
diagram  and  doubles  the  length  of  the  analytic  expression.  However,  the 
search  using  this  expression  is  accomplished  with  the  aid  of  the  same 
Horner  scheme  and  requires  little  machine  time.  The  use  of  the  method 
of  approximate  solution  of  high-order  equations  does  not  require  in¬ 
crease  of  the  time  for  the  initial  processing  of  the  diagram  and  does 
not  increase  the  memory  volume  used  for  the  analytic  representation  of 
the  diagram.  However,  during  the  search  it  is  necessary  to  perform  for 
each  diagram  relatively  lengthy  (12-17  sec  on  the  Ural-*!  computer)  com¬ 
putational  operations,  which  for  an  array  of  1000  diagrams  requires  an 
additional  20-25  min  of  time.  It  appears  that  the  first  approach,  i.e., 
the  use  of  the  Inverted  polynomials,  is  mere  efficient  in  spite  of  its 
complexity.  This  method  has  been  used  in  the  search  program  described 
below. 

The  general  form  of  the  diagram  gives  information  concerning  the 
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chemical  composition  of  the  components  and  phases  on  the  diagram.  Using 

as  an  example  the  binary  metallic  systems,  an  attempt  has  been  made 

[13]  to  give  for  each  line  and  each  field  of  the  diagram  a  standardized 

formal  notation.  Thus,  for  example,  all  the  verticals  of  the  congruent- 

ly  melting  compounds  formed  by  the  interaction  of  the  original  compo- 

II 

nents  are  denoted  by  A  B  ,  the  lines  of  the  eutectic  transitions  are 
denoted  by  ABgt  the  eutectoid  transitions  by  ABg^t  the  synthetic  tran¬ 
sitions  by  ABgt  the  liquidus  lines  by  Lj a,  the  solidus  lines  by  oa, 
etc.  In  the  analysis  of  a  large  number  of  published  diagrams  there  were 
found  finite  sets  of  lines  which  describe  (bound)  the  same  regions.  The 
processing  of  these  data  made  it  possible  to  compile  tables. of  the  so- 
called  regional  logic  polynomials.  Thus,  the  region  L  (liquid)  in  the 
general  form  will  be  expressed  by  the  following  regional  logic  polyno¬ 
mial: 

[AmB\  v^Caii  AM*.®"  V  A*?’ 

Thus,  it  is  possible,  at  least  for  the  binary  metallic  systems,  to 
determine  from  the  form  of  the  logical  polynomial  the  nature  of  the  re¬ 
gion  and,  consequently,  there  is  no  need  for  a  special  introduction  in¬ 
to  the  michine  memory  of  information  concerning  the  regions  on  the 
state  diagrams.  The  nature  of  the  general  representation  may  be  seen  in 
greater  detail  from  the  description  of  the  experimental  automatic  ' 
search  system  using  the  state  diagrams.  The  system  was  designed  for  the 
Ural-1!  computer. 

The  first  version  of  the  automatic  search  system  using  the  state 
diagrams  does  not  taka  into  account  all  the  forms  of  the  representa¬ 
tions  developed  for  ihe  diagram  chemlsm.  In  particular,  it  does  not  re¬ 
flect  the  standardised  formal  notations  of  the  lines  which  was  men¬ 
tioned  above.  Therefore  the  Information  concerning  the  regions  and  the 
singular  points  of  the  diagram  are  entered  ahead  of  time,  and  are  not 
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worked  out  in  the  process  of  the  initial  processing  of  the  diagram. 

For  the  initial  -o', easing  all  the  information  concerning  the  dia¬ 
gram  is  divided  into  three  subarrays  ,  OST^.,  Sh^,  where  is  the 
subarray  containing  information  on  the  diagram  metrics  and  th:'  chemism 
of  the  diagram  regions;  OST.  is  the  subarray  containing  a  list  of  the 
singular  points  of  a  given  specific  diagram  using  the  previously  fixed 
list  of  standard  singular  points;  Sh^  is  the  "cap,"  i.e.,  the  general 
form  of  the  diagram. 

For  the  entire  collection  of  diagrams,  consequently,  there  are 
three  individual  arrays. 

1.  The  array  of  diagrams 

Alt «  NJ\i,  NJlt . . . , 

where  N.  is  the  recording  depth  of  a  single  diagram  (number  of  cells). 
The  initial  form  of  D.  is  constructed  in  turn  as  follows : 
diagram  number , 

number  of  the  let  curve  (using  an  arbitrary  enumeration  of  the 
curves  within  each  diagram) , 

the  coordinates  of  the  points  of  this  curve 

,,c 

the  punctuation  mark  Aj  (end  of  curve  1), 
number  of  the  2nd  curve  and  so  on  for  all  the  curves, 
punctuation  sign  A*  (end  of  coordinate  recording  of  all  the 
-urves ) , 

region  number 

number  of  the  uppermost  curve  in  the  region, 
number  of  the  lowest  curve  in  the  region, 

punctuation  sign  A s  (end  of  region),  and  so  on,  for  all  the  re¬ 
gions  , 

punctuation  iiign  A4  (end  cf  region  array),  region  dictionary;  the 


. .  .  -  -  ,  ,  ,  .  ,  . . . ' 

region  number  Is  the  designation  of  the  region, 

punctuation  symbol  A (end  of  region  dictionary), 
punctuation  symbol  A ^  (end  of  diagram). 

II.  The  array  of  singular  points  Is 

M,  _  A'jOCTt/A'tOCT,,  ... 

The  Initial  form  of  the  OST.  is  constructed  in  turn  as  follows: 

t 

diagram  number, 
singular  point  code, 

its  coordinates  (x.y. ),  and  so  on,  for  all  the  singula>  points  of 
of  the  diagram. 

III.  The  cap  array  is 

... 

Each  cap  Includes 
diagram  number, 
system  components, 

code  of  the  property  described  by  the  diagram, 

code  for  the  dimensions  of  the  composition  expression, 

bibliographic  citation  code. 

The  order  of  arrangement  of  the  diagrams  in  these  three  separate 
array;  may  be  arbitrary  and  they  may  not  correspond  to  one  another.  The 
identification  is  accomplished  using  the  diagram  number. 

The  initial  processing  of  the  diagram  array  is  accomplished  simul¬ 
taneously  for  the  entire  collection  and  has  as  its  objective  the  reduc¬ 
tion  of  the  initial  representations  to  a  form  which  in  convenient  for 
rapid  and  complete  search.  In  particular,  from  the  array  Mj  there  is 
developed  a  step  (here  and  in  the  following  in  this  section  a  step  is 
the  number  of  cells  assigned)  for  each  diagram,  a  step  for  the  curve 
subarray, .for  the  region  aubarray,  etc.,  and  there  are  also  worked  out 
by  approximating  the  tabular  data  the  polynomials  y  •  f(x)  and  x  •  V(y) 
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for  each  diagram  curve  using  an  a  priori  specified  magnitude  of  the  ac¬ 
ceptable  deviation.  In  this  case  the  polynomials  are  described  by  the 
coefficients  of  the  variable  with  an  Indication  of  the  degree  of  the 
polynomial. 

The  entire  array  If  reorganized  so  that  It  will  be  easy  to  Identi¬ 
fy  all  the  diagrams  on  the  three  basic  arrays.  After  this  Initial  proc¬ 
essing  we  obtain  three  new  arrays  m m ,, . 

The  array  Is  obtained  by  processing  the  M ^  array.  The  essence 
of  the  processing  was  described  above.  The  array  Is  the  same  singu¬ 
lar  point  array  N2»  while  the  cap  Array  m3  becomes  dominant  and  Is  ar¬ 
ranged  as  follows: 

**  -  HIiC0i>  <OCT,>.  !!!,<&>  <OCT*> .... 

where  <D^>  Is  the  address  of  the  diagram  In  the  array  n^;  <OCT^>  Is 
the  address  of  the  subarray  of  singular  points  of  the  diagram  in  the 
array  mg. 

The  search  program  solves  seven  basic  search  problems. 

Problem  n ^  -  for  a  given  concrete  diagram  determine  from  the  composi¬ 
tion  the  magnitude  of  a  property  (from  x  find  all  values  of  y). 

Problem  -  for  a  given  concrete  state  diagram  or  for  all  state  dia¬ 
grams  with  some  one  common  component  generate  the  value  of  a  property 
on  the  liquidus  line  from  the  given  composition  (from  x  find  m). 
Problem  -  for  a  given  diagram  using  a  specified  magnitude  of  a  prop¬ 
erty  determine  the  system  composition  (one  value  of  the  composition). 
Problem  n4  -  for  a  given  diagram  or  for  all  diagrams  in  an  array,  using 
the  value  of  a  property  determine  all  the  system  compositions  corre¬ 
sponding  to  this  prr  ... 

Problem  n 4  -  from  tin.  w„< .  coordinates  of  a  describing  point  find  the 
diagram  region  In  which  It  la  located. 

Problem  »»#  -  determine  all  the  singular  points  of  the  diagram. 
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Problem  n7  -  output  all  information  concerning  the  diagram. 

It  has  been  shewn  that  the  majority  of  problems  reduce  either  to 
these  seven  types  of  basic  problems  or  to  their  combinations.  The  ques¬ 
tion  (inquiry)  prior  to  input  into  the  computer  must  have  the  form 

2  ’  "inquiry ‘"problem- 4  (4,t*  *t*i‘  ani  00  on>- 

The  search  program  permits  simultaneous  introduction  of  5-7  in¬ 
quiries  (l.e.,  no  more  than  20  values  of  and  10  values  of  y^).  The 
search  Itself  consists  of  two  major  portions:  the  generation  of  the 
control  sequence  and  the  extraction  of  the  information.  The  control  se¬ 
quence  determines  in  what  order  we  must  refer  to  the  arrays  m ^  and  m2, 
and  only  after  this  is  the  information  concerning  the  diagrams  extract¬ 
ed  and  the  search  problems  solved.  During  the  search  the  array  m ^  is 
transferred  by  parts  in  order  to  generate  the  control  sequence  from 
magnetic  tapes  onto  the  computer  magnetic  drum,  while  the  arrays  m ^  and 
m2  remain  on  the  magnetic  tapes  throughout  the  search.  The  problem  so¬ 
lution  time  does  not  exceed  30-40  sec  for  an  array  with  400  diagrams. 

On  the  average  the  time  for  the  initial  processing  of  each  diagram 
is  about  3  min. 

We  see  from  the  system  description  that  its  use  still  does  not 
permit  the  solution  of  problems  associated  with  finding  the  diagrams  of 
conanon  topological  structure.  To  do  this,  in  the  new  system  thert  must 
be  introduced  a  representation  of  the  "chemism"  of  the  lines,  i.e.,  a 
representation  of  the  nature  of  the  physico-chemical  transformations  in 
the  system  as  the  describing  point  moves  over  the  diagram.  In  addition, 
the  entire  diagram  must  be  described  as  a  physico-chemical  graph  using 
the  method  of  the  star  table.  Such  a  system  will  permit  solving  any  in¬ 
formation  search  problems. 

The  collection  of  composition-property  diagrams  stored  in  the  com¬ 
puter  memory  permits  the  solution  also  of  certain  correlation  and  corr- 
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putatlon  problems  of  physico-chemical  analysis.  In  this  case  the  com¬ 
puting  capabilities  of  the  machine  Itself  are  used  In  full  measure.  An 
example  of  the  correlation  problems  la  the  statistical  verification 
[14]  of  the  Kordes  equation  [15]: 

«  _  r,-r.  r. 

tt — ir~*z=rr  • 

where  Ta,  T ^  and  Tg  are  respectively  the  molecular  (atomic) 

concentrations  and  crystallization  temperatures  of  the  components  and 
the  eutectic.  Using  as  an  example  20  metallic  and  34  salt  systems  with 
a  eutectic  (without  secondary  complications)  on  a  digital  computer  us¬ 
ing  the  data  stored  In  the  computer  memory ,  It  was  shown  that  the  Kor¬ 
des  equation  Is  not  precise,  and  may  be  used  only  as  the  basis  for  a 
very  rough  statistical  prognosis.  In  this  case,  after  finding  the  sec¬ 
tions  of  the  surface  described  by  the  Kordes  equation,  we  found  the  di¬ 
rect  regression  equations  for  the  salt  systems: 

and  for  the  metallic  systems: 

It  Is  Interesting  that  the  correlation  relations  derived  after  20 
min  are  close  In  thel^  form  to  the  correlation  equations  for  the  liquid 
azeotropic  systems  [16],  obtained  by  the  authors  after  several  months 
of  comparisons  and  calculations.  We  see  from  this  example  the  vast  cap¬ 
abilities  for  studying  and  logical  processing  of  the  material  of  physl- 
cc-cnemical  analysis  offered  by  the  system  of  machine  recording  and 
machine  processing  of  the  composition-property  diagrams. 

III.  AUTOMATIC  NOMENCLATURE  TRANSLATION 

One  of  the  first  situations  whe. e  the  chemical  information-logic 
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methods  are  applicable  in  processing  various  chemical  Information  la 
the  automatic  conversion  of  various  sorts  of  chemical  compound  names 
Into  formulas  In  order  to  make  it  possible  to  store  them  uniquely  In 
the  computer  memory.  When  outputting  from  the  computer  the  results  of 
the  operation  of  the  search  algorithms  frequently  the  resulting  com¬ 
pound  must  be  presented  In  the  fora  of  a  name.  In  this  case  we  need 
a  unique  Inverse  automatic  translation)  i.e.,  the  translation  of  an 
equation  Into  a  name  which  Is  given  In  some  systematic  nomenclature 
system.  In  addition  to  these  utilitarian  problems,  a  similar  machine 
translation  of  names  may  be  used  advantageously  for  compiling  formula 
indices  for  abstract  Journals,  handbooks,  etc. 

The  possibility  of  using  the  modern  electronic  computers  for  con¬ 
verting  the  names  of  chemical  compounds  into  their  formulas  is  based  on 
representing  the  name,  on  the  one  hand,  and  the  formula  of  the  chemical 
compound;  on  the  other  hand,  as  mutually  unique  calculable  functions. 
With  this  approach  the  machine  conversion  of  a  name  into  a  formula  (or 
a  formula  into  a  name)  is  considered  as  a  computational  process  per¬ 
formed  usirg  a  definite  program.  However,  the  compound  name  is  not 
mathematical  information,  therefore  its  machine  conversion  is  a  problem 
of  a  logical  nature  having  much  in  common  with  the  problems  of  automat¬ 
ic  translation  from  one  language  into  another. 

The  variety  of  the  nomenclature  systems  in  chemistry  and  primarily 
the  ambiguity  of  the  names  in  each  system  make  the  problem  of  automatic 
nomenclature  translation  extremely  complex.  This  applies  to  the  names 
of  both  the  organic  and,  particularly,  the  inorganic  compounds.  The 
first  attempts  at  automatic  nomenclature  translation  in  organic  chemis¬ 
try,  in  spite  of  the  existence  in  this  field  of  more  or  less  universal 
nomenclature  systems,  because  of  the  complexity  of  the  translation 
rules,  have  been  Halted  so  far  to  individual  classes  of  compounds  or 


types  of  names.  In  inorganic  chemistry  the  experimental  translations  of 
names  into  formulas  and  vice  versa  have  been  conducted  only  by  the  au¬ 
thor  of  the  present  paper  with  his  co-workers  [17-20].  In  the  following 
we  shall  examine  some  results  of  these  experiments  and  the  chemical 
bases  for  them. 

First  let  us  agree  to  call  the  translation  of  names  into  formulas 
the  direct  automatic  nomenclature  translation,  and  the  translation  of 
formulas  into  names  will  be  termed  inverse  translation.  In  the  field  of 
direct  nomenclature  translation  the  names  of  the  complex  compounds  were 
the  first  to  be  studied.  The  selection  of  the  complex  compounds  for 
creating  the  first  program  for  machine  translation  of  names  into  formu¬ 
las  is  based  on  the  fact  that  the  complex  compounds  may  be  named  using 
several  rational  nomenclatures  while  the  systematic  names  themselves 
are  quite  complex  both  with  respect  to  principle  of  construction  and 
with  respect  to  structure.  The  trivial  names  of  the  complex  compounds 
such  as  luteo  salt,  Peron  salt,  let  Reyze  base,  which  are  widely  used 
and  previously  were  in  competition  with  the  systematic  names,  do  not 
require  logical  processing  for  machine  decoding  and  are  easily  trans¬ 
lated  by  simply  finding  in  a  dictionary  the  corresponding  formula 
equivalent.  However,  such  a  dictionary  becomes  extremely  bulky  for  the 
ever  increasing  number  of  trivial  names  in  the  literature  and  possibly, 
has  already  reached  the  saturation  limit.  Therefore  as  the  initial  in¬ 
formation  we  selected  the  rational  names  using  any  of  the  accepted  and 
suggested  nomenclatures:  Werner’s  [21]  with  supplements  of  Chernyayev 
[23]  and  Greenberg  [22],  the  international  IUPAC  [24],  the  nomenclature 
suggested  by  the  8 th  Mendeleyev  Congress  of  the  Commission  of  the  OKhN 
of  the  Academy  of  Sciences  USSR  [25]. 

First,  the  rules  for  construction  and  the  structure  of  the  names 
of  the  complex  compounds  in  all  the  listed  nomenclatures  were  subjected 
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to  formal  logic  analysis.  In  this  process  we  identified  typical  roots 
and  the  so-called  search  operators  which  relate  the  roots  in  a  single 
name.  The  identified  roots  and  the  search  operators  were  combined  into 
a  "root  dictionary." 

The  program  consists  of  four  basic  blocks:  identification  of  the 
roots,  analysis,  combinatorics,  formula  synthesis,  and  formula  print¬ 
out.  In  the  root  identification  block  the  term  is  broken  down  into 
chemically  meaningful  roots.  This  division  is  accomplished  with  the 
aid  of  the  root  dictionary  stored  in  the  computer  memory.  Table  2  shows 
as  an  example  certain  sections  of  this  dictionary. 

The  entire  translation  process  from  name  input  to  formula  printout 
is  accomplished  automatically  and  consists  of  3500-6000  elementary  log¬ 
ic  and  arithmetic  operations.  The  difference  in  the  number  of  elementa¬ 
ry  operations  required  for  the  translation  of  various  names  depends  on 
the  name  length  and  the  frequency  of  encounter  in  the  name  of  letters 
which  occupy  the  most  "populated"  portions  of  the  dictionary.  The  aver¬ 
age  machine  time  for  translating  one  name  is  3-5  sec. 

Following  are  several  specimens  of  such  a  machine  translation. 

Input: 

potassium  dicyanoargentate,  or 
dicyano-(l+)  potassium  argentate. 

Output: 

00212  00001  37777  00201  00001  00205  00002  37777  00001, 
i.e.,  the  formula  K1[Ag1(CN)2]1. 

Input: 

hexachloroplat  acid,  or 
hexachlorc-(lJ+)  platinate  acid. 

Output: 

00215  00002  37777  00202  00001  00206  00006  37777  0000 1, 
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TABLE  2 


Gaof 

1 

P««r 

2 

Ckmim 

J _ 

Kon6kh«- 

top 

4  ' 

Kn^  CMMPU14  HpN 
MflNCN  +<»pMy/IM 

■  MM«Tb  KaiklNMM 

5 

<  OCNOBH 

7  ••prtHT- 

A 

M 

00201 

8  •WIST- 

A 

Pt 

00202 

(•niuM- 

A 

Co 

00204 

1  0  .UMNO- 

H 

(CNj 

ftWII 

002.1;, 

1  1  -MOpO- 

b 

Cl 

nmmi 

00200 

1  2 

fi 

(Nil,) 

(imi 

out  t 

1  S  "(MM* 

c 

K 

mm 

00212 

1  ‘♦-KMCVIOT* 

c 

H 

(mil 

110215 

1  S-cy;u$»T 

D 

(SO,) 

im>2 

00217 

1  (noKCKOuie  onepiTopw 

17  . 

a 

00002 

1  t  -Terpi- 

a 

00004 

1  9  -WTp- 

2  0  -r*KC»- 

a 

00006 

2  l  -Kiin- 

'a 

00005 

-a- 

p 

00001 

o 

•t- 

p 

00004 

2  2 

P 

- 

000O3 

(3+) 

2  3  ■  T.  A. 

Remarks .  Rank:  A)  complexing  agent,  B)  ligand,  C)  outside-sphere  cat- 
ion,  D)  out side -sphere  anion.  The  combiner  is  a  number  whose  last  digit 
is  equal  to  the  valence. 

1)  Root;  2)  rank;  3)  symbol;  4)  combiner;  5)  symbol  code  when  writing 
formula  in  machine  memory;  6)  bases;  7)  argent;  8)  plat;  9)  cobalt;  10) 
cyano;  11)  chloro;  12)  amin;  13)  potassium;  14)  acid;  15)  sulfate;  16) 
search  operators;  17)  di;  18)  tetra;  19)  tetr;  20)  hexa;  21)  penta;  22) 
i;  23)  et  cetera. 


i.e.,  the  formula  H2[Pt1Clg]1. 

Input: 

pentaaminchlorocobalt  sulfate,  or 
sulfate  of  pentamine-chloro-cobalt  (3+). 

Output: 

37777  ($0204  00001  00211  00005  00206  00001  37777  00001 
00217  00001, 

i.e.,  the  formula  [Co^NH^Cl^CSO^. 

Here  the  five-digit  numbers  of  the  type  00212,  00206  (with  two 
initial  zeroes)  are  the  symbols  of  the  elements  or  radicals;  their  val¬ 
ues  are  given  in  Table  2.  Numbers  of  the  type  00001,  00006  (with  four 
initial  zeroes),  given  in  the  name  itself  or  generated  by  means  of  com- 
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binatorics,  are  the  subscripts  of  the  symbols  and  brackets.  The  number 
37777  is  the  left-hand  and  right-hand  square  brackets.  The  index  1, 
usually  not  written  in  the  formulas,  is  retained  for  convenience  of 
various  sorts  of  logic  operations  with  the  formulas  which  are  required 
for  solving  various  information  problems. 

Par  more  complex  and  considerably  less  defined  is  the  translation 
-  of  the  names  of  the  conventional  (noncomplex)  inorganic  compounds,  for 

which  as  yet  there  is  no  rational  nomenclature.  The  IUPAC  nomenclature 
and  the  draft  of  the  Nomenclature  Commission  of  the  Division  of  Chemi¬ 
cal  Sciences  of  the  Academy  of  Sciences  USSR  may  to  some  degree  facil¬ 
itate  the  translation,  although  the  introduction  of  a  new  nomenclature 
is  a  lengthy  process  and  the  old  names  will  continue  to  be  encountered 
in  the  literature  for  a  long  time  to  come.  But  even  with  the  availabil¬ 
ity  of  a  unified  nomenclature  we  must  first  of  all  face  the  fact  that 
the  translation  process  must  be  as  formal  as  possible  with  minimal  re¬ 
liance  on  additional  (other  than  the  name  itself)  information.  The  no¬ 
menclature  systems  mentioned  above,  unfortunately,  have  several  alter¬ 
natives  and  admit  differences  of  reading.  Therefore,  in  the  initial 
stage  the  problem  of  direct  nomenclature  translation  of  the  names  of 
the  inorganic  compounds  was  intentionally  limited  to  the  simple  and 
most  frequently  encountered  compounds.  However,  even  in  this  case  we 
encountered  several  difficulties  associated  with  the  indeterminacy  of 
the  names  of  even  the  simple  compounds.  The  algorithm  is  described  in 
detail  in  [18,  20] .  Here  we  shall  only  mention  that  for  the  translation 
we  must  have  in  the  computer  memory  the  following: 

1)  the  root  dictionary, 

2)  the  search  operator  table, 

3)  the  control  operator  table, 

4)  a  summary  of  the  rules  for  transforming  the  names  of  the  acids, 
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bases,  and  compounds  of  the  thio-,  oxy-,  and  halogenides  and  the  chal- 
cogenides . 

In  the  root  dictionary  (containing  more  than  3000  entries)  there 
are  collected  all  the  roots  encountered  in  the  names  with  their  formula 
equivalents  and  additional  information  on  the  possible  valency  of  the 
atoms  or  radicals  in  different  situations  (as  anions,  cations,  in  homo- 
polar  compounds,  oxides,  etc.)*  The  search  operators  are  prefixes  of 
the  type  mono-,  homo-,  tetra-,  hepta-,  monovalent-,  etc. 

In  the  table  of  control  operators  there  are  collected  operators  of 
the  type:  basic-,  acid-,  mono-substituted-,  secondary-,  etc. 

The  translation  process  itself  consists  of  breaking  the  name  into 
roots  with  the  aid  of  the  root  dictionary  and  tables,  replacing  the 
roots  using  definite  rules  by  their  equivalents,  combinatorics  (selec¬ 
tion  of  subscripts  for  the  symbols  of  the  atoms  or  radicals),  and  "ed¬ 
iting”  the  resulting  formula  (dropping,  where  necessary,  brackets,  re¬ 
arranging  symbols,  etc.). 

Even  with  the  extensive  additional  information  obtained  after  a 
detailed  logical  analysis  of  the  Russian  chemical  literature,  the  ex¬ 
perimental  testing  of  the  algorithm  did  not  always  yield  a  unique  re¬ 
sult.  However,  the  indeterminacy  and  ambiguity  is  not  a  defect  of  the 
algorithm,  but  a  result  of  the  indeterminacy  of  the  names  themselves. 
Just  as  the  chemist  when  he  encounters  the  term  "ruthenium  chloride," 
without  a  logical  analysis  of  the  entire  context  he  cannot  way  whether 
the  discussion  concerns  RuCl^  or  perhaps  RuCl^,  or  even  some  other  com¬ 
pound,  similarly  the  program  stored  in  the  machine  does  not  lead  to  a 
single  formula  In  this  case.  This  applies  equally  to  the  oxides,  thlo- 
halogenldes,  etc. 

Normally  for  the  purposes  of  storing  the  chemical  compound  in  the 


144  - 


digital  computer  memory  the  direct  automatic  nomenclature  translation 
is  used.  The  problem  of  Inverse  nomenclature  translation  was  formulated 
in  connection  with  the  necessity  for  output  from  the  computer  of  the 
results  of  automatic  search  among  the  inorganic  compounds  VJbe  result 
of  one  of  the  forms  of  search  [2]  is  a  list  of  compoundr  which  satisfy 
an  entire  set  of  specified  questions.  Since  in  the  computer  memory 
there  is  stored  a  list  of  formulas  of  the  inorganic  compounds,  we  must 
output  from  the  computer  either  these  same  formulas  or,  if  this  is  not 
possible,  the  names.  We  can  always  present  the  x’ormula  in  coded  form, 
i.e.,  replacing  the  symbols  of  the  chemical  elements  by  some  digital 
code  notations.  However,  this  is  not  always  convenient.  There  is  no 
Latin  alphabet  on  the  Soviet  computers  with  alphanumeric  printers  (for 
example,  the  Ural-4  computer).  Therefore,  it  was  desirable  to  translate 
the  formula  automatically  into  the  systematic  name,  which  can  be  print¬ 
ed  out  by  letters  in  the  Russian  alphabet. 

The  problem  of  inverse  automatic  nomenclature  translation  of  the 
inorganic  compounds  is  less  associated  with  such  a  narrow,  utilitarian 
objective.  It  is  of  interest  primarily  because  it  yields  the  possibili¬ 
ty  of  studying  from  the  point  of  view  of  formal  and  semantic  uniqueness 
any  nomenclature  systems.  Of  no  little  importance  in  such  a  study  is 
the  question  of  minimizing  the  number  of  logic  operations  required.  The 
fewer  the  logic  operations  in  the  translation  from  the  formula  to  the 
name,  the  more  efficient,  the  more  rational  the  nomenclature.  In  the 
formal  analysis  of  the  nomenclature  systems  we  must  also  pay  attention 
to  their  universality,  i.e.,  to  the  ability  to  encompass  any  inorganic 
compounds . 

As  a  result  of  a  study  using  the  Inverse  automatic  nomenclature 
translation  of  the  XUPAC  and  Commission  on  Nomenclature  of  the  OKhN  of 
the  Academy  of  Sciences  USSR  nomenclatures  it  was  found  that  they  are 


not  universal,  since  they  encompass  only  the  primary  types  of  chemical 
compounds.  Moreover,  at  times  the  translation  requires  serious  reanaly¬ 
sis,  which  Is  associated  with  a  considerable  complication  of  the  pro¬ 
grams.  Thus,  for  the  names  of  the  heteropoly  anions  and  the  heteropoly 
cations  we  must  seek  the  central  atom,  calculate  how  many  acid  groups 
remain  In  the  compound,  etc.  However,  very  frequently,  as,  for  example. 
In  the  cases  K2Mg2V10O2g *16H20  or  even  KgMnMOgO^  we  are  not  able  to 
determine  the  formal  characteristics  of  the  hetero-  and  poly  anions, 
and  the  computer  is  forced  to  give  two  or  more  types  of  names.  Attempts 
to  Improve  the  IUPAC  nomenclature  and  Its  reflection  In  the  draft  of 
the  Commission  on  Nomenclature  of  the  OKhN  have  not  been  successful. 

The  artificial  technique  of  considering  any  compound  simply  as  combina- 

* 

tions  of  ions  (anion  and  cation  parts),  used  In  the  IUPAC  nomenclature, 
in  addition  to  its  unjustified  artificiality,  complicates  the  programs 
so  much  that  it  makes  the  translation  itself  extremely  time-consuming 
and  worst  of  all  ambiguous.  The  results  of  the  formal  analysis  of  the 
nomenclatures  will  be  presented  in  a  separate  communication  in  greater 
detail.  Considering  the  inapplicability  of  the  existing  nomenclature 
systems  as  name  systems  at  the  computer  output,  we  have  used  the  Nekra¬ 
sov  system  which  is  most  universal  and  absolutely  unique,  also  as  a 
simple  set  of  translation  rules.  The  essence  of  this  system  is  that  in 
the  majority  of  cases  the  nomenclature  translation  is  transformed  into 
tf>e  operation  of  "putting  into  words"  the  formulas  without  their  logi¬ 
cal  conversion.  The  sense  of  this  puttlng-lnto-words  is  easily  seen 
from  the  examples: 

Na20  -  disodium  oxide; 

CaO  -  calcium  oxide; 

0PC1  -  oxo-phosphor-chloride; 

-  pentabor-nonahydrlde; 


SiHgFCl  -  silicon-dihydro-fluor-chlorine,  etc. 

For  the  individual  atomic  groupings  (in  our  notation  they  appear 
It  the  formula  only  in  round  brackets)  the  group  names  are  retained  and 
used.  For  example, 

Fe{N0g)2  -  iron-dinitrate; 

-  tetrasulfur-tetraimine,  etc. 

We  have  not  been  able  to  find  a  single  example  of  an  inorganic 
compound  formula  which  cannot  be  completely  uniquely  translated  into  a 
systematic  name  following  .Nekrasov.  The  program  compiled  for  the  Ural-1! 
computer  contains  without  the  auxiliary  dictionaries  about  1500  in¬ 
structions,  and  the  translation  of  a  formula  into  a  name  together  with 
printout  of  the  results  (i.e.,  the  name  itself)  requires  1-2  sec. 

IV.  INFORMATION-LOGIC  METHODS  OF  CALCULATING  COMPOUND  PROPERTIES 

In  the  first  section  of  this  paper  we  introduce  the  concept  of  the 
universal  class  V,  whose  elements  are  all  properties  of  all  chemical 
compounds  and  their  combinations,  and  also  all  conceivable  characteris¬ 
tic  conditions  for  determining  the  properties  of  each  chemical  object. 
Going  still  further  in  this  direction,  we  must  introduce  still  another 
class  -  the  set  of  governing  laws  or  relations  Vy  This  set  consists  of 
the  mathematical  equations  relating  both  the  elements  within  the  sub¬ 
sets  of  objects  Qt  parameters  (properties)  P,  characteristic  conditions 
U,  and  also  the  elements  from  the  various  subsets  of  the  universal  set 

V.  Many  of  these  relations  are  the  mathematical  expression  of  the  laws, 
relationships,  and  rules  already  available  in  chemistry,  derived  theo¬ 
retically  or  determined  experimentally.  Certain  of  these  equations, 
namely  the  Antoine  equation  and  the  power  series  for  the  specific  heat, 
were  utilized  in  the  previously  discussed  automatic  search  systems  for 
the  inorganic  compounds.  Having  available  in  the  computer  memory  an  ar- 
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ray  of  information  on  a  quite  large  number  of  chemical  compounds,  it  is 
quite  advantageous  to  supplement  it  by  certain  algorithms  which  realize 
the  information-logic  methods  of  nonsearch  operations  with  arrays. 

It  is  of  advantage  to  include  in  such  information-logic  methods: 

1)  the  methods  of  finding  a  concrete  value  of  a  property  if  the 
latter  is  contained  in  the  information  accompanying  the  compound  in  im¬ 
plicit  form  (direct  calculation; : 

2)  methods  of  finding  (predicting)  the  concrete  value  of  a  proper¬ 
ty  by  analogy  with  the  parameters  of  other  similar  compounds  or  even  of 
this  same  compound,  but  under  oth^r  characteristic  conditions  (compara¬ 
tive  calculation); 

3)  methods  of  finding  (predicting)  the  concrete  value  of  a  proper¬ 
ty  of  a  compound  from  the  presence  in  the  compound  of  definite  struc¬ 
tural  fragments  (strukturno-additivnyy  calculation); 

4)  methods  of  finding  new  functional  or  correlation  relations  in 
the  information  array  concerning  the  properties  of  the  chemical  com¬ 
pounds  (general  approach). 

We  must  immediately  admit  that  this  listing  is  hardly  all-encom¬ 
passing,  since  the  development  of  chemistry,  on  the  one  hand,  and  ex¬ 
perimental  studies  in  the  field  of  machine  operations  with  arrays  of 
chemical  information,  on  the  other  hand,  without  doubt  will  bring  to 
life  new  methods  or  will  require  essential  alteration  of  the  listed 
methods. 

Let  us  examine  in  greater  detail  all  these  methods  from  the  aspect 
of  the  basic  premises  and  procedures  for  their  realization  on  modern 
electronic  computers. 

The  direct  calculation  methods  are  applicable  in  the  general  sys¬ 
tem  to  the  degree  that  there  exist  and  are  described  in  the  literature 
various  empirical  formulas  of  a  nonadditive  nature.  There  are  many  such 
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formulas,  but  they  all  have  extremely  limited  field  of  application.  For 
our  purposes  the  formulas  of  greatest  Interest  are  those  of  the  type 
P  ■  f(U),  describing  the  variation  of  the  property  (parameter)  with 
variation  of  the  characteristic  definition  conditions.  Usually  pressure 
or  temperature  appears  as  this  characteristic  condition.  Storing  in  the 
computer  memory  only  the  empirical  coefficients  of  such  an  equation, 
for  a  given  compound  we  always  have  the  possibility  of  calculating  the 
value  of  the  property  with  variable  values  of  the  characteristic  condi¬ 
tions.  This  method,  as  we  mentioned  previously,  was  used  in  the  auto¬ 
matic  search  system  for  the  Inorganic  compounds. 

The  strukturno-addlolvnyy  calculation  is  based  on  adding  some  in¬ 
crement  (partial  fraction)  to  the  individual  fragments  of  the  structur¬ 
al  formula  or  even  to  the  fractions  of  some  measured  geometric  parame- 

2 

ter  of  an  atom  or  molecule.  The  algebraic  sum  of  these  increments  and 
the  increments  of  the  relative  positioning  of  the  structural  fragments 
yields  the  magnitude  of  the  desired  property. 

This  method  does  not  work  for  all  properties,  but  does  for  the  so- 
called  additive  properties  end  primarily  only  for  comparatively  narrow 
classes  of  chemical  compounds.  The  following  basic  equation  is  charac¬ 
teristic  for  each  strukturno-additivnyy  method  of  calculation: 

P*  *=> 

1-1 

where  P 1  is  the  magnitude  of  the  desired  property  of  the  given  com¬ 
pound;  is  the  increment  (partial  fraction)  for  the  given  property 
•  per  one  definite  structural  fragment  of  the  molecule  or  per  a  definite 

interaction  of  these  fragments;  n ^  is  the  number  of  the  given  structur¬ 
al  fragments  in  the  molecule . 

We  used  this  basic  equation  in  [ 26 j  to  design  a  machine  scheme  for 
additive  calculation  of  the  physico-chemical  properties  of  the  paraffin 
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hydrocarbons  as  a  supplement  to  the  Information  search  system.  In  those 
cases  when  In  the  information  array  stored  in  the  computer  memory  there 
is  no  value  of  some  property  of  the  alkane  (usually  higher)  specified 
by  the  inquiry ,  this  machine  calculation  scheme  is  put  into  operation. 
The  problem  was  posed  of  calculating  the  property  of  a  paraffin  hydro¬ 
carbon  using  not  only  the  formula  but  also  the  name.  Thus,  the  problem 
was  divided  into  three  stages:  construct  automatically  from  the  name, 
given  in  some  systematic  nomenclature,  the  structural  formula,  identify 
from  this  formula  definite  structural  fragments,  and,  finally,  from  the 
identified  fragments  perform  the  calculation.  The  use  of  the  formulas 
and  set  of  fragments  presented  in  [27,  28]  made  it  possible  to  create  a 
scheme  taking  into  account  more  than  10  types  of  properties  for  any 
paraffin  hydrocarbon.  For  any  paraffin  hydrocarbon,  depending  on  the 
complexity  of  its  structure,  the  automatic  machine  calculation  of  the 
properties  together  with  the  procedure  fcr  direct  nomenclature  transla¬ 
tion  and  identification  of  the  structural  fragments  consists  of  1000- 
2000  computer  operations  and  lasts  only  1-2  sec.  This  speed  makes  pos¬ 
sible,  first,  the  large-scale  calculation  of  the  constants,  and,  sec¬ 
ond,  somewhat  exceeds  the  speed  of  the  search  for  the  constants  stored 
in  the  computer  memory.  This  makes  it  possible  to  replace  the  search 
for  the  constants  by  a  calculation  in  certain  cases  which  do  not  re¬ 
quire  definiteness. 

i 

The  primary  drawback  of  this  method  of  calculation  lies  in  its  ba¬ 
sic  assumptions.  Thus,  only  in  the  first  approximation  can  we  speak  of 
the  additivity  of  many  properties  without  consideration  for  the  inte¬ 
gral  parameter  of  the  entire  molecule.  Therefore  the  strukturno-addi- 
tlvnyy  methods  are  frequently  complicated  by  the  Introduction  of  "rela¬ 
tive  positioning"  increments,  for  example,  increments  of  the  interac¬ 
tion  of  the  structural  fragments  as  a  function  of  the  type  of  lnterme- 
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diate  or  surrounding  atoms  or  bonds.  Essentially  this  complication  Is 
caused  by  the  desire  to  Introduce  Into  the  calculation  some  Integral 
factor.  The  machine  calculation  using  such  complicated  strukturno- 
addltlvnyy  schemes  has  great  possibilities  both  with  respect  to  the  set 
of  properties  which  can  be  considered  and  with  respect  to  accuracy 
[293. 

Comparative  calculation.  By  tils  term  we  usually  mean  two  ap¬ 
proaches  which  have  a  different  basis.  First,  this  Is  the  use  of  a  reg¬ 
ular  variation  of  some  property  with  other  conditions  being  the  same  in 
a  series  of  "similar,"  "analogous,”  "monotypie"  compounds,  syutems  of 
substances,  or  reactions.  Second,  this  is  the  use  of  a  regular  varia¬ 
tion  of  some  property  of  one  substance  with  a  variation  of  the  charac¬ 
teristic  conditions. 

The  six  methods  of  comparative  calculation  formulated  by  Karapet’- 
yants  [30]  are  based  on  these  two  approaches. 

The  interest  in  the  comparative  methods  of  calculation  of  the 
properties  of  compounds  for  use  in  the  information-logic  systems  is  de¬ 
termined  by  the  fact  that  they  are  essentially  a  ready-made  model  of 
comparison  by  analogy  and  operate  on  the  most  varied  classes  of  chemi¬ 
cal  compounds.  The  basic  difficulty  in  formalizing  the  comparative  cal¬ 
culation  methods  is  the  a  priori  formation  of  series  of  similar  com¬ 
pounds,  monotypie  reactions,  etc.  We  made  an  attempt  at  such  a  formal¬ 
ization  in  [31].  However,  the  logical  aimilarity  criteria  worked  uut  in 
that  study  are  obviously  Inadequate  and  in  each  particular  case  they 
must  be  supplemented  by  contensive  criteria  of  chemical  similarity. 
Therefore,  with  a  very  simple  algorithm  of  the  calculation  itself,  made 
using  formulas  of  the  general  form 

o,  -  «c,  +  a, 

where  £j  are  the  properties  or  their  elementary  functions  for  one 
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or  more  objects;  a  and  b  are  empirical  coefficients,  the  basic  diffi¬ 
culty  if  the  technique  for  selecting  the  objects  for  comparison.  At  the 
present  time  we  make  extensive  use  of  the  ''trial  and  error"  method  for 
doing  this,  leading  in  the  limiting  case  to  the  sorting  of  nearly  the 
entire  array. 

General  approach.  The  attempt  to  create  a  regular  procedure  for 
finding  the  governing  laws  in  an  array  of  chemical  information  and  the 
subsequent  calculation  is  the  so-called  general  (formal)  approach.  In 
developing  this  method  we  started  from  the  postulate  that  to  each  chem¬ 
ical  compound  there  is  placed  in  correspondence  some  point  in  an  n-di- 
mensional  space,  where  n  is  the  number  of  parameters  (properties)  which 
form  the  basic  system,  and  that  the  (n  +  l)st  property  may  be  repre¬ 
sented  in  some  closed  region  of  variation  of  the  parameters  by  a  con¬ 
tinuous  function  with  derivatives  which  are  bounded  in  absolute  magni¬ 
tude  by  the  positive  number  M  [32]. 

Expanding  this  function  (not  knowing  its  form)  in  a  Taylor  series, 
by  the  method  of  least  squares  we  can  find  the  sought  governing  law. 
Using  certain  structural  parameters  of  the  molecule  as  the  basic  system 
of  parameters,  we  have  found  a  series  of  empirical  formulas  for  calcu¬ 
lating  the  properties  of  the  hydrocarbons  [33,  3^]. 
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3.  =  e  =  evtekticheskiy  =  eutectic 

3fl  =  ed  -  evtektoidnyy  »eutectoid 

c  *  s  *  sinteticheskiy  =  synthetic 
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HI  ■  Sh  ■  shapka  ■  cap 
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AN  INFORMATION  SYSTEM  ON  THE 
MATHEMATICAL  THEORY  OF  EXPERIMENTATION 

I.  GENERAL  DESCRIPTION  OF  SYSTEM 

V.V.  Nalimov,  Yu.P.  Adler,  and  Yu.V.  Granovskiy 

The  tasks  of  an  Information  system.  An  information  system  is  being 
designed  for  collective  search  for  new  ideas  in  that  field  of  knowledge 
which  we  might  term  the  mathematical  theory  of  experimentation.  It  is 
assumed  that  new  ideas,  no  matter  where  they  are  stated:  in  studies  of 
a  mathematical  nature  or  in  experimental  investigations  -  will  aid  in 
the  development  of  new  trends  in  our  activity.  Here  we  have  in  mind  not 
a  simple  mechanical  borrowing  of  foreign  ideas,  but  their  reanalysis. 

It  is  assumed  that  the  information  system  facilitates  the  process  of 
generating  new  Ideas,  at  least  for  those  investigators  who  have  the 
capability  of  associating  a  large  number  of  varied  propositions. 

The  mathematical  theory  of  experimentation  is  based  primarily  on 
vhe  methods  of  mathematical  statistics.  But  it  seems  to  us  that  these 
two  divisions  of  knowledge  cannot  be  identified,  just  as,  let  us  say, 
we  cannot  identify  quantum  mechanics  with  those  mathematical  disci¬ 
plines  on  which  it  is  based.  In  the  mathematical  theory  of  experimenta¬ 
tion  there  appear  many  nonstatlstical  methods  of  study,  but  there  do 
not  appear  certain  deeply  theoretical  divisions  of  mathematical  statis¬ 
tics. 

This  statement  may  be  illustrated  by  a  few  examples.  One  of  the 
important  tasks  of  the  mathematical  theory  of  experimentation  is  find¬ 
ing  the  optimal  conditions  for  the  operation  of  technological  process- 
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es.  Here,  along  with  the  probability  approach  there  is  possible  a 


strictly  deterministic  approach  with  the  subsequent  use  of  variational 
analysis,  dynamic  programming,  and  the  Pontryagin  maximum  principle.  In 
some  publications  both  the  probability  and  the  variational  methods  of 
analysis  are  used  simultaneously.  There  may  also  be  publications  on  the 
mathematical  theory  of  experimentation  in  which  along  with  the  tradi¬ 
tional  statistical  methods  use  is  made  of  the  ideas  and  methods  of  in¬ 
formation  theory,  game  theory.  Finally,  the  mathematical  theory  of  ex¬ 
perimentation  is  essentially  related  with  the  numerical  methods  of 
analysis  and  with  modern  computer  technology.  Obviously,  the  designed 
information  system  must  be  sufficiently  selective  —  it  must  select  not 
all  works  on  the  divisions  of  knowledge  listed  above,  but  only  those 
which  have  direct  bearing  on  the  subject  being  considered. 

It  is  planned  to  construct  the  information  system  so  that  it  will 
permit  both  the  following  of  the  development  of  new  trends  in  the  cur¬ 
rent  publications,  and  also  carrying  out  a  retrospective  search  using 
the  publications  of  recent  years.  The  first  of  these  tasks  Is  the  re¬ 
sult  of  the  existing  system  for  organizing  library  activities.  As  a 
rule,  the  libraries  of  our  scientific  research  institutes  and  universi¬ 
ties  do  not  obtain  even  the  basic  foreign  statistical  journals.  The  in¬ 
vestigators  performing  creative  work  in  the  field  of  the  mathematical 
theory  of  experimentation  do  not  have  the  capability  of  immediate  anal¬ 
ysis  of  the  development  of  foreign  science.  The  information  system  ba- 
ing  designed  must  fill  this  gap.  The  second  task  formulated  above  is 
the  result  of  the  fact  that  the  single  existing  system  for  retrospec¬ 
tive  search  -  the  subject  index  of  the  RZh  Matematika  does  not  have  a 
sufficiently  deep  indexing  system.  Experience  shows  that  this  informa¬ 
tion  system  cannot  be  used  for  precise  formulation  of  search  problems. 

The  information  system  being  designed  must  encompass  both  mathe- 
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matical  works  of  a  methodological  nature,  and  publications  presenting 
the  experience  in  application  of  new  methods  of  investigation.  Here  ac¬ 
count  must  be  taken  of  the  special  place  that  mathematical  statistics 
occupies  among  the  other  sciences  [1],  On  the  one  hand,  this  is  un¬ 
doubtedly  a  mathematical  discipline  whose  study  requires  serious  mathe¬ 
matical  preparation  -  in  all  the  countries  of  the  world  it  is  taught  in 
the  mathematics  departments.  On  the  other  hand,  mathematical  statistics 
has  one  characteristic  which  is  not  typical  of  the  purely  mathematical 
disciplines.  In  contrast  with  the  purely  mathematical  disciplines,  here 
we  cannot  limit  ourselves  to  such  criteria  of  value  as  the  profoundness 
of  ideas,  refinement  of  construction,  and  logical  noncontradiction.  In 
evaluating  the  new  statistical  methods  of  analysis  we  must  first  of  all 
consider  their  practical  effectiveness.  Experience  shows  that  new  and 
very  fruitful  trends  in  mathematical  statistics  arise  after  we  are  able 
to  formulate  well  the  problem.  We  must  be  able  to  propose  a  formulation 
which  will  correspond  to  the  intuitive  understanding  of  the  experiment¬ 
ers.  The  mathematical  reanalysis  of  what  is  done  by  talented  experi¬ 
menters  will  aid  in  formulating  new  problems.  Hence  it  Is  clear  that  it 
is  very  important  to  follow  studies  published  in  nonmathematical  jour¬ 
nals. 

Search  system.  Considering  the  arguments  Just  stated  above,  we 
have  proposed  two  different  systems  for  information  search. 

1.  The  search  of  the  methodological  studies  will  be  performed  by 
direct  scanning  of  10-20  basic  statistical  Journals.  Gradually.,  as  the 
information  system  expands,  the  number  of  Journals  will  be  increased. 

We  must  be  aware  that  with  this  search  system  we  will  miss  all  those 
methodological  studies  which  are  published  in  nonmathematical  journals. 
Here  we  might  mention  that  the  International  Journal  of  Abstracts,  Sta¬ 
tistical  Theory,  and  Methods*  performs  a  systematic  search  of  the  meth- 
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odological  studies  In  417  Journals  such  as  Cancer  Bulletin,  Forest  Sci¬ 
ence,  Textile  Research,  Tabacco  Science,  etc.  Experience  shows  that 
methodological  articles  are  published  quite  frequently  in  250  special¬ 
ized  journals  of  this  type.  In  order  to  fill  the  gaps  which  are  un¬ 
avoidable  with  our  search  system,  we  must  enter  on  punched  cards  the 
abstracts  from  the  International  Journal  of  Abstracts,  from  the  Ameri¬ 
can  abstract  journal  Quality  Control  and  Applied  Statistics,*  and  from 
the  RZh  Matematika  (VINITI  AN  USSR).  All  these  materials  will  Inevit¬ 
ably  arrive  with  the  considerable  delay  which  is  characteristic  of  the 
abstract  journals.  Referral  to  the  abstract  journal  must  be  considered 
as  only  a  temporary  measure.  It  is  possible  that  in  the  future  we  will 
be  able  to  establish  a  working  arrangement  with  the  International  Ab¬ 
stract  Journal  In  order  to  exchange  results  of  the  publication  search 
before  the  corresponding  abstracts  are  written  and  published. 

2.  The  search  for  experimental  studies  in  which  use  is  made  of  the 
mathematical  theory  of  experimentation  must  be  carried  out  over  the  en¬ 
tire  array  of  journals  and  conference  publications  which  relate  to  ex¬ 
perimental  physics,  chemistry,  metallurgy,  geology,  biology,  agrobiol¬ 
ogy*  psychology,  sociology,  etc.  If  we  limit  ourselves  to  the  most  im¬ 
portant  Journals,  then  we  would  still  need  to  deal  with  more  than 
10,000  titles  of  various  issues.  In  order  to  make  this  task  at  least  to 
some  degree  feasible,  we  propose  to  conduct  the  publication  searcn  only 
with  respect  to  the  bibliographic  references.  From  the  set  of  publica¬ 
tions  being  considered  we  shall  select  a  subset  in  which  the  name  of  at 
least  one  of  the  scientists  who  have  made  a  contribution  to  the  mathe¬ 
matical  theory  of  experimentation  is  listed  in  the  bibliography.  The 
first  list  of  such  scientists  has  been  compiled.  In  this  "key"  list  are 
included  also  the  authors  of  well-known  monographs  and  certain  popular 
articles.  With  this  search  system  there  is  no  need  to  even  read  the  ti- 
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ties  of  the  articles.  The  search  Is  performed  mechanically,  by  un¬ 
skilled  personnel  or,  If  this  Is  possible,  by  a  computer.  The  technique 
presented  here  has  long  been  used  In  a  somewhat  different  form  by  ex¬ 
perienced  personnel  of  the  bibliographic  service.  When  they  need  to  ob¬ 
tain  some  idea  of  the  contents  of  a  publication  written  in  some  inac¬ 
cessible  language  (Japanese,  Hungarian),  they  usually  resort  to  an 
analysis  of  the  bibliography  presented  at  the  end  of  the  article. 

We  made  a  little  experiment  on  this  sort  of  information  search  in 
the  chemical  and  metallurgical  Journals.  The  search  is  accomplished  on 
the  average  at  a  rate  of  6  Journals  per  hour,  if  there  is  direct  access 
to  the  shelves  with  the  Journals.  After  5  hours  and  scanning  30  Jour¬ 
nals  we  had  found  18  articles  of  interest  to  us.  Further  experiments 
showed  that  we  miss  about  25%  of  the  publications  which  we  really  need. 
It  was  found  that  our  key  list  did  net  include  certain  authors  whose 
publications  are  "daughters”  with  respect  to  the  primary  authors.  It 
will  be  necessary  to  make  a  very  careful  study  of  the  additions  to  the 
key  list.  However,  the  missing  of  even  25%  of  the  publications  of  an 
experimental  nature  is  not  critical.  In  part  the  gaps  will  be  filled 
with  some  delay)  as  a  result  of  the  abstracts  taken  from  the  abstract 
Journals. 

The  search  system  using  the  bibliographic  citations  is  used  by  the 
American  firm  Institute  for  Scientific  Information,  Philadelphia,  USA. 
There  are  interesting  examples  in  the  information  list  of  this  firm. 

Two  of  them  relate  to  two  articles  of  the  well-known  statistician 
Tukey.  A  summary  is  presented  of  all  those  works  in  which  there  Is  a 
bibliographic  reference  to  these  articles.  This  summary  Is  undoubtedly 
of  great  interest.  By  scanning  these  publications  we  can  immediately 
obtain  an  Idea  of  where  the  ideas  presented  in  the  article  of  interest 
to  us  aroused  Interest.  A  detailed  analysis  is  presented  in  [2]  of  the 
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bibliographic  coupling  of  the  articles  of  the  Journal  Physical  Review. 

Information  carriers.  As  the  information  carrier  in  our  informa¬ 
tion  system  we  used  the  standard  K5  punched  card  of  1H7  x  207  mm  format 
with  two-row  edge  punching.  The  sorting  of  these  punched  cards  is  ac¬ 
complished  manually  or  with  the  aid  of  a  simple  electromechanical  vi¬ 
brator.  The  selection  of  the  edge-punched  cards  as  tne  information  car¬ 
rier  is  a  compromise  solution  which  permits  in  some  degree  satisfying 
both  of  the  requirements  formulated  above  -  following  the  current  pub¬ 
lications  and  retrospective  search.  Code  information  (notches  on  the 
card  edge)  will  serve  for  the  retrospective  search.  Information  pre¬ 
sented  in  ordinary  language  will  be  used  for  the  current  monitoring. 

The  bibliographic  description  of  the  publication  and  a  summary  or  brief 
abstract  are  printed  on  the  card.  A  microfilm  with  a  complete  text  of 
the  article  may  also  be  mounted  in  the  card  [3]. 

Filling  out  the  card.  Speed  -  this  is  one  of  the  basic  require¬ 
ments  imposed  on  the  information  system.  The  completed  punched  cards 
(with  mounted  microphotocopy)  must  arrive  at  the  personnel  interested 
in  them  within  a  few  days  after  the  appearance  of  the  journal.  In  order 
to  shorten  the  time  expended  in  preparation  of  the  card,  the  informa¬ 
tion  (title,  abstract,  or  summary)  must  be  reprinted  on  the  card  from 
the  journal  word-by-word  without  translation  info  Russian.  The  follow¬ 
ing  system  for  filling  out  the  card  is  suggested:  the  typist  reprints 
on  the  cards  word-for-word  the  information  and  transmits  them  together 
with  the  journal  to  the  abstracter  -  a  highly  qualified  specialist  on 
mathematical  statistics.  The  abstracter  scans  the  articles,  codes  their 
contents,  and,  if  necessary,  writes  a  very  short  summary  -  one  or  two 
lines.  In  contrast  with  abstracting,  the  coding  takes  very  little  time 
-  the  abstracter  should  not  study  carefully  the  contents  of  the  arti¬ 
cle,  it  is  sufficient  to  scan  the  article  and  gather  what  is  being 
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talked  about.  Our  operating  experience  shows  that  from  one  to  two  work¬ 
ing  days  are  required  for  coding  and  a  brief  summary  of  a  journal  of 
average  size. 

Key  words  and  the  coding  system.  Our  suggested  coding  system  is 
presented  in  the  second  part  of  this  article.  We  code:  1)  bibliographic 
information  (year  of  publication,  type  of  publication,  language,  first 
letter  of  surname  of  the  first  author);  2)  the  mathematical  concepts 
(key  words)  which  characterize  the  contents  of  the  article;  3)  field  of 
application  (for  works  of  an  experimental  nature).  The  contents  of  the 
publication  may  be  characterized  by  several  mathematical  concents, 
therefore  they  are  coded  using  a  direct  code.  The  fields  of  application 
are  coded  using  a  combined  code  —  here  it  is  assumed  that  the  publica¬ 
tions,  as  a  rule,  will  relate  only  to  some  one  applied  discipline. 

Only  140  mathematical  concepts  may  be  coded  in  our  system.  Experi¬ 
ence  shows  [4]  that  the  ratio  of  the  number  of  publications  to  the  num¬ 
ber  of  key  words  should  not  be  greater  than  two  orders.  It  appears  that 
our  information  system  will  reach  its  reasonable  upper  limit  after  5-7 
years.  Before  that  time  it  will  be  necessary  to  prepare  for  creating  a 
descriptor  information  system  with  a  large  number  of  key  concepts  (of 
the  order  of  1000).  As  the  information  carrier  in  such  systems  use  may 
be  made  of  the  so-called  superposition  punched  cards  [5].  We  hope  that 
specialized  information-logic  computers  will  be  ready  for  production  in 
the  course  of  the  next  5-10  years. 

It  is  Interesting  to  compare  our  system  of  key  mathematical  con¬ 
cepts  (see  the  second  part  of  this  article)  with  the  system  of  the  In¬ 
ternational  Journal  of  Statistical  Abstracts  Csee  Appendix  1).  Our  sys¬ 
tem  was  developed  independently.  In  spite  of  this,  there  i:*  no  great 
difference  between  the  systems.  The  classification  of  the  International 
Journal  Is  compiled  from  the  point  of  view  of  the  researchers  interest- 
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ed  primarily  in  the  development  of  mathematical  static  tics  as  an  inde¬ 
pendent  scientific  discipline.  The  key  concept  system  of  the  American 
journal  Quality  Control  and  Applied  Statistics  (see  Appendix  2)  differs 
significantly  both  from  our  system  and  from  the  system  of  the  Interna¬ 
tional  Journal  of  Statistical  Abstracts.  Here  there  is  clearly  the  re¬ 
sult  of  the  unique  formulation  of  the  problem  -  the  operation  is  con¬ 
sidered  primarily  from  the  point  of  view  of  the  statistician  -  the  spe¬ 
cialist  on  quality  control.  (Quality  control  is  an  engineering  special¬ 
ty  which  is  widely  used  abroad.  The  American  Society  of  Quality  Control 
has  16,000  members  and  has  its  own  periodic  publications,  etc.). 

Comparative  evaluation  of  the  abstract  journals.  It  is  interesting 
to  compare  the  Information  systems  in  existence  at  the  present  time 
from  the  point  of  view  of  the  problem  of  interest  to  us. 

A  portion  of  the  RZh  Matematika  is  the  section  Probability  Theory 
and  Mathematical  Statistics,  Theoretical  Cybernetics,  which  Is  al30  is¬ 
sued  in  the  form  of  a  separate  volume.  This  is  a  wide  ranging  journal. 
It  includes  the  following  sections. 

I.  Probability  theory  and  mathematical  etatistic8 . 

1.  Probability  theory. 

2.  Mathematical  statistics. 

3.  Application  of  probability-theoretic  and  statistical  methods. 

II.  Theoretical  cybernetics . 

1.  General  questions. 

2.  Control  system  theory  and  its  application. 

3.  Automatic  control  theory. 

*i.  Information  theory. 

5.  Operations  research  and  mathematical  economics. 

6.  Theory  of  mathematical  machines. 

7.  Mathematical  and  cybernetic  questions  of  biology  and  poycnolo- 
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gy- 

8.  Mathematical  questions  of  semiotics. 

III.  Author  index. 

In  sections  1.2  and  1.3  th$re  are  located  primarily  abstracts  on 
mathematical  works  of  a  methodological  nature.  The  methodological  works 
of  a  nonmathematical  nature  are  not  covered  completely.  The  selection 
of  such  works  is  somewhat  random.  The  abstracts  are  accompanied  by  code 
indices  using  the  UDK  (Universal  Decimal  Classification). 

The  International  Journal  of  Statistical  Abstracts*  is  a  narrowly 
specialized  Journal,  publishing  mathematical  studies  of  a  methodologi¬ 
cal  nature.  The  abstracts  are  arranged  in  the  journals  in  accordance 
with  the  adopted  system  cf  key  concepts.  Paper  of  different  color  is 
used  for  the  different  sections.  The  abstracts  are  short  (400  words  is 
recommended  by  "UNESCO").  They  are  printed  only  on  one  side  of  the 
page.  Each  abstract  is  accompanied  by  code  notation  which  characterizes 
the  contents  of  the  article.  The  journal  may  be  easily  unbound  and  re¬ 
bound  by  basic  sections.  We  mentioned  previously  that  417  journals  were 
examined  for  the  selection  of  the  publications  (in  1963).  These  jour¬ 
nals  are  distributed  by  countries  as  follows  (Jt):  USSR-1.44;  USA-29.98; 
Great  Britain-11.99;  India-8.87;  in  German-7.43;  Italy-7.19;  France- 
3.60;  Aust,'alia-3.12;  Japan-3.12;  other:  Netherlands-2.64;  Hungary- 
1.68;  Egypt-1.22;  Czechoslovak  SSR-0.72,  etc. 

The  American  abstract  Journal  Quality  Control  and  Applied  Statis¬ 
tics  is  also  of  a  narrowly  specialized  nature.  As  a  rule,  it  publishes 
long  abstracts  -  two  and  even  sometimes  three  pages  in  length.  It  is 
assumed  that  the  readers  using  such  extensive  abstracts  will  be  able 
to  obtain  a  clear  idea  of  the  paper  without  recourse  to  reading  the 
original  publication.  The  abstracts  are  presented  in  a  strictly  stand¬ 
ardized  form,  having  the  following  sections:  heading,  author,  Journal, 
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purpose  of  the  study,  summary,  results. 

Each  abstract  is  accompanied  by  code  notations  which  characterize 
the  contents  of  the  article.  (A  characteristic  of  the  Journal  is  speed. 
As  a  rule,  the  abstracts  appear  with  a  delay  of  less  than  six  months.) 

Comparing  these  three  Journals,  we  can  draw  the  following  conclu¬ 
sions. 

1.  In  the  RZh  Matematika  the  abstracts  on  works  of  a  theoretical 
nature  are  written,  as  a  rule,  on  a  high  level.  This  cannot  be  said  of 
certain  abstracts  written  on  the  statistical  studies  of  a  methodologi¬ 
cal  nature.  As  a  rule,  in  the  foreign  Journals  mentioned  above  the 
mathematical  studies  of  a  methodological  nature  are  abstracted  on  a 
higher  level.  Mathematical  statistics  has  today  become  a  broadly 
branched  discipline.  In  many  branches  of  this  science  we  do  not  have 
adequately  qualified  abstractors. 

2.  We  must  recognize  the  very  successful,  clear  arrangement  of  ma¬ 
terial  by  headings  used  in  the  International  Journal  of  Statistical  Ab¬ 
stracts.  It  Is  considerably  easier  to  follow  the  development  of  Indi¬ 
vidual  branches  of  mathematical  statistics  In  the  International  Journal 
than  in  the  RZh  Matematika. 

3.  The  specially  developed  titling  systems  of  the  foreign  journals 
are  considerably  more  convenient  than  that  used  in  the  UDK  system  which 
we  have  adopted.  We  get  the  impression  that  the  UDK  indices  In  applica¬ 
tion  to  works  on  mathematical  statistics  may  be  considered  as  noisy 
components  of  the  abstracts  rather  than  anything  else. 

4.  In  1963  the  following  number  of  abstracts  were  published  in  the 
three  journals  considered. 

RZh  Matematika.  Individual  issue  Probability  Theory 

and  Mathematical  Statistics,  Theoretical  Cybernetics. 

Sections  1.2  and  1.3  1273 
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International  Journal  of  Statistical  Abstracts, 

Methodological  Series  1000 

Quality  Control  and  Applied  Statistics  5^17 

Fewer  abstracts  are  published  in  the  foreign  abstract  Journals 
than  in  the  corresponding  sections  of  the  RZh  Matematika.  But  this  does 
not  mean  that  the  publications  of  the  foreign  Journals  may  be  consid¬ 
ered  as  subsets  of  the  publications  of  RZh  Matematika.  In  these  Jour¬ 
nals,  particularly  in  the  Journal  Quality  Control  and  Applied  Statis¬ 
tics,  there  are  often  included  abstracts  of  articles  from  publications 
which  VINITI  does  not  receive.  In  the  near  future  we  plan  to  make  a 
special  study  of  the  fields  where  the  three  abstract  Journals  do  not 
overlap . 

Analysis  of  growth  curves.  Using  the  information  system,  it  will 
be  possible  to  follow  the  development  of  individual  trends  of  the 
mathematical  theory  of  experimentation,  make  a  comparative  estimate  of 
the  development  of  mathematical  statistics  in  the  Soviet  Union,  make 
certain  predictions,  and  present  individual  recommendations. 

The  growth  curves  constructed  from  the  Individual  sections  of  the 
mathematical  theory  of  experimentation  may  be  of  considerable  interest. 
One  such  curve  is  shown  in  the  figure.  Here  the  years  are  plotted  along 
the  abscissa  axis,  the  ordinate  axis  is  the  total  number  of  methodolog¬ 
ical  works  relating  to  the  planning  of  screening  experiments  in  biol¬ 
ogy.  This  trend  of  mathematical  statistics  is  developing  in  connection 
with  the  problems  of  crop  husbandry,  animal  husbandry,  investigation  of 
pharmaceutical  preparations,  herbicides,  etc.  The  curve  was  plotted 
from  the  bibliography  of  [6].  The  curve  is  approximated  quite  well  by 
an  exponential.  A  sharp  deviation  from  the  exponential  growth  is  seen 
in  the  war  years,  which  is  quite  natural.  The  last  points  also  deviate 
somewhat  from  the  curve.  Two  hypotheses  may  be  suggested  to  explain 
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this  phenomenon:  the  rate  of  growth  is  diminishing  -  the  exponential  is 
transitioning  into  a  logistic  curve;  the  bibliography  has  not  been  ac¬ 
cumulated  sufficiently  completely  in  recent  years.  To  be  conservative, 
it  would  be  wiser  to  accept  the  second  hypothesis  in  the  present  case. 
Here  the  relative  rate  of  growth  is  12 . 95JK  -  This  corresponds  to  doub¬ 
ling  the  total  number  of  works  in  5.^  years.  A  similar  growth  curve  was 
constructed  for  the  methodological  works  on  factor  experiment  (in  the 
sense  in  which  this  is  understood  in  psychological  studies).  Here  we 
used  the  extensive  bibliography  presented  in  [7].  In  this  case  the  rel¬ 
ative  rate  of  growth  was  8.59?,  which  corresponds  to  doubling  the  num¬ 
ber  of  works  in  8  years.  Finally,  using  the  materials  of  our  bibliogra¬ 
phy  we  plotted  a  growth  curve  for  the  total  number  of  works  (methodo¬ 
logical  and  experimental)  on  the  planning  of  extremal  experiments.  Here 
the  relative  growth  rate  was  ^3.6?,  which  corresponds  to  doubling  in 
1.6  years.  It  is  pertinent  to  mention  that  the  growth  rate  of  the  num¬ 
ber  of  publications  in  the  major  divisions  of  science  such  as  physics, 
chemistry,  biology  varies  from  5  to  7%,  which  corresponds  to  doubling 
the  number  of  works  in  10-15  years  [8].  The  growth  rate  of  works  on 
planning  of  extremal  experiments  must  be  considered  to  be  very  high.  It 
is  interesting  to  compare  the  growth  curves  for  two  trends  in  statis¬ 
tics  —  planning  of  extremal  and  factor  experiments.  In  the  first  case 
the  total  number  of  works  of  a  methodological  nature  is  an  order  less, 
and  the  growth  rate  is  higher  by  nearly  a  factor  of  5.  This  is  ex¬ 
plained  by  the  fact  that  the  first  of  these  trends  began  to  develop  in 
1913*  the  second  started  only  in  1951.  Scanning  the  journals,  investi¬ 
gators  usually  Judge  the  development  of  new  trends  by  the  total  number 
of  publications,  and  noo  by  the  rate  of  growth.  It  is  clear  that  in 
this  case  many  rapidly  developing  tiends  may  remain  unnoticed. 

After  the  new  trends  in  the  mathematical  theory  of  experimentation 
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have  been  Identified,  it  is  natural  to  turn  to  the  second  step  -  com¬ 
pile  surveys  and  give  a  vigorously  Justified  evaluation  of  the  new 
trends.  In  certain  cases  monographs  will  be  written  as  the  next  step 
following  the  reviews. 

Using  the  information  system,  we  will  also  be  able  to  obtain  use¬ 
ful  information  on  the  preparation  and  retraining  of  personnel,  on  the 
activity  of  the  scientific  societies,  on  the  organization  of  scientific 
research  and  consultative  centers,  on  seminars,  conferences,  etc. 


so  so  00- 

t ,  toOb  2 


Growth  curve  of  number  of  works 
on  screening  experiment.  1)  To¬ 
tal  number  of  works;  2)  t ,  yrs. 

II.  KEY  WORDS  AND  THE  CODE  SYSTEM  IN  THE  PUNCHED  CARD  FILE  "MATHEMATI¬ 
CAL  THEORY  OF  EXPERIMENTATION" 

Yu. P.  Adler,  I.F.  Aleksandrova,  G.G.  Vorob'yev,  Yu.V.  Granovskiy,  Ye.V. 
Markova,  and  V.V.  Nalimov 

General  principles.  1.  The  punchcard  file  uses  the  "manual"  search 
methods  which  are  suitable  for  200-1000  searches  per  month  with  a  num¬ 
ber  of  punchcards  for  a  single  search  not  exceeding  10,000-40,000.  Con¬ 
sidering  the  growth  rate  of  the  number  of  works  in  this  field,  we  can 
expect  that  the  card  file  will  not  exceed  these  limitations  in  the 
course  of  the  next  10  years. 
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2.  The  coding  of  the  bibliographic  and  subject  information  is  ac¬ 
complished  by  cutting  openings  (edge  notching  or  connecting  openings 
with  one  another)  corresponding  to  the  given  characteristic. 

3.  The  punchcards  used  are  similar  to  those  of  the  RZh  Metallur- 
giya,  with  minimal  changes  in  the  format  which  are  noted  below.  All 
the  publications  are  classified  under  11  subject  headings,  each  of 
which  has  10  subheadings. 

it.  The  internal  field  of  the  card  is  divided  into  two  equal  parts. 
The  left  part  and  the  corresponding  part  on  the  reverse  side  are  in¬ 
tended  for  the  abstract  text.  The  right  side  remains  free.  The  produc¬ 
tion  of  transparent  photographic  microcards  (up  to  itO  pages),  which 
will  be  mounted  in  the  punchcard,  will  be  organized  in  the  future. 

Coding.  1.  The  search  labels  (key  words)  are  divided  into  two  cat¬ 
egories:  parallel  (for  example,  subject)  and  mutually  exclusive  (for 
example,  year  of  publication).  Depending  on  this  division  we  use  re¬ 
spectively  direct  and  combination  codes.  Each  label  is  coded  by  a  num¬ 
ber.  The  direct  code  places  in  correspondence  with  the  label  (number)  a 
single  notch,  while  the  combination  code  uses  a  combination  of  several 
notches.  In  a  given  card  file  the  direct  code  is  used  for  coding  the 
matnematical  concepts.  The  combination  code  is  used  for  coding  certain 
bibliographic  information  and  information  on  fields  of  application. 

2.  The  punchcard  adopted  permits  three  versions  of  standard  notch¬ 
es  : 

1)  a  shallow  notch  -  an  opening  of  the  outer  row  is  brought  out  to 
the  edge; 

2)  a  deep  notch  -  an  opening  of  the  inner  row  is  brought  out  to 
the  edge; 

3)  a  combining  notch  (slit)  connects  a  pair  of  openings;  one  in¬ 
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ner  and  one  outer. 


3.  For  the  direct  code  the  shallow  notches  correspond  to  the  la¬ 
bels  which  are  coded  by  the  openings  of  the  outer  row,  the  connecting 
openings  (slits)  correspond  to  the  labels  which  are  coded  by  the  open¬ 
ings  of  the  Inner  row,  and  the  deep  notches  correspond  simultaneously 
to  the  outer  and  Inner  openings . 

**•  When  searching  for  a  subheading  of  the  outer  row,  the  needle  Is 
inserted  In  the  outer  opening.  In  this  case  both  the  cards  having  only 
this  label  and  the  cards  containing  simultaneously  an  opening  of  the 
inner  row  fall  out. 

When  searching  for  a  subheading  of  the  inner  row  the  needle  is  in¬ 
serted  first  in  the  opening  of  the  inner  row.  After  separating  the 
cards  with  a  deep  notch  the  remaining  stack  of  cards  is  grasped  by  the 
left  hand  and  the  needle  which  is  removed  is  now  inserted  in  any  open¬ 
ing  of  the  inner  row  and  the  separated  cards  are  selected.  In  other 
cases  the  search  is  conducted  similarly. 

5.  Only  two  types  of  notches  are  used  for  the  combination  code: 
shallow  and  deep.  For  all  the  labels  on  the  upper  edge  of  the  card 
(other  than  the  first  letter  of  the  author  surname,  see  below)  we  use 
the  key  7-4-2-1,  which  permits  establishing  the  correspondence  between 
the  number  (code)  and  a  pair  of  notches. 

When  It  Is  necessary  to  code  100  labels  rather  than  10,  we  can  use 
two  fields  with  the  code  7-^-2-l.  In  this  case  ones  are  coded  in  one 
field,  and  tens  in  the  other. 

Filling  out  the  cards.  First  all  the  data  are  entered  on  a  model 
of  the  card  which  is  printed  on  cheap  newsprint.  The  code  system  is 
noted  on  the  model.  The  entry  of  data  on  the  model  is  accomplished  di¬ 
rectly  by  research  personnel.  The  data  are  transferred  from  the  model 
to  the  punchcard,  which  does  not  contain  the  expanded  code  system  -  the 
card  area  is  retained  for  mounting  a  miorofilm  of  the  publication. 
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Transfer  of  the  data  from  the  model  to  the  card  and  duplication  of  the 
cards  may  be  performed  by  technicians. 

The  system  "model  with  expanded  code  -  card  without  code"  is  con¬ 
venient  in  operation.  By  using  this  technique  we  can  significantly  sim¬ 
plify  the  most  tedious  process  -  the  coding,  without  encumbering  the 
basic  information  carrier,  the  card,  with  the  code  notations. 

Along  with  the  coded  information,  the  following  data  are  entered 
in  the  usual  fashion  on  the  card: 

1)  publication  data  of  the  article  (translation  of  the  title  of 
the  article  is  not  mandatory); 

2)  indication  of  the  abstract  journal  if  a  search  is  being  made  of 
the  journal; 

3)  indication  on  the  availability  of  a  photocopy  of  the  journal 
article  at  the  information  center; 

4)  indication  of  the  location  of  a  book  and  its  library  number. 

An  abstract  or  a  short  summary  is  also  printed  on  the  card. 

Use  of  the  card  file.  1.  It  is  advisable  that  the  cards  be  stored 
in  stacks  of  1000  units  in  drawers  with  dimensions  which  fit  the  cards 
precisely.  The  size  of  a  working  stack  is  200-250  cards. 

2.  When  processing  the  working  stack  it  is  oriented  with  the  oper¬ 
ating  edge  up  and  aligned.  Then,  grasping  the  stack  with  the  left  hand 
near  the  required  opening,  the  needle  is  introduced  with  the  right 
hand  (palm  up).  Then  the  left  hand  is  shifted  to  the  left  edge  of  the 
stack,  this  edge  is  brought  toward  the  operator  and  then  moved  away. 

This  spreads  out  the  cards  on  the  needle  and  permits  the  free  cards  to 
fall  out,  which  are  gathered  by  the  left  hand.  The  piles  of  cards  which 
have  fallen  free  and  those  which  remain  must  be  carefully  verified  to 
be  sure  that  there  are  no  cards  randomly  stuck  together  and  which  have 
not  fallen  out  and  also  to  verify  that  there  are  no  cards  which  have 
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fallen  out  because  of  edge  tears. 

3.  This  card  file  is  a  single  unit  which  is  assigned  the  code  1. 
With  further  development  of  the  card  file  we  plan  to  organize  several 
units  on  related  questions.  For  storage  each  unit  may  be  arranged  with 
respect  to  any  subject  tag  or  in  an  arbitrary  order. 

4.  The  search  plan  must  be  arranged  so  that  in  the  first  selection 
we  obtain  a  minimal  number  of  cards.  With  a  reduction  of  the  number  of 
cards  which  fall  out  to  20-30  it  is  advisable  to  change  over  to  the 
conventional  manual  sorting. 

Adding  to  and  correcting  the  system.  1.  The  search  system  provides 
the  possibility  of  supplementing  by  using  the  vacant  combinations  in 
the  code,  free  openings  in  the  fields,  and  free  fields  if  there  are 
any.  The  three  corner  openings  should  not  be  used,  since  they  enable 
the  extraction  from  the  card  file  of  accidentally  inverted  cards. 

2.  When  correcting  the  system  (for  example,  when  improving  the 
classification)  the  subject  headings  are  partially  or  completely  as¬ 
signed  new  values.  For  combined  storage  of  old  and  new  cards  a  signal 
notch  is  made  in  one  of  the  unused  openings  of  the  new  cards.  For  a 
slight  change  of  the  model  the  cards  with  obsolete  notches  are  removed 
from  the  card  file  and  the  unnecessary  notches  in  these  cards  may  be 
accurately  patched  or  new  notches  may  be  cut. 

Equipment .  1.  The  necessary  equipment  includes: 

1)  drawers  which  are  precisely  fitted  to  the  card  width  and  are 
designed  for  1000  cards; 

2)  a  tumbler-spitsa  with  one  pointed  end  and  a  handle  on  the  other 
end.  The  material  is  stainless  steel.  Dimensions  are:  length  of  working 
section  (without  handle)  200-300  mm,  diameter  of  working  section  1.5-2 
mm; 

3)  a  manual  punch  for  the  notches  (of  the  railroad  ticket  punch 
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type).  This  may  be  replaced  by  scissors  and  a  safety  razor  blade. 

2.  More  complex  equipment: 

1)  mechanical  punch  for  simultaneous  punching  of  one  opening  in  a 
stack  of  cards; 

2)  a  coding  machine  for  simultaneously  punching  all  the  necessary 
openings ; 

3)  needles  with  a  catch  (of  the  safety-pin  type); 

4)  a  clip  for  manual  sorting  using  several  openings  at  once. 

Other  equipment  may  also  be  used. 

3.  The  Microfot  reader  with  a  special  holder  is  used  for  reading 
the  aperture  punchcards  (with  microcopies  of  the  text  inserted). 

Code  system.  1.  The  index  for  the  first  letter  of  the  author  sur¬ 
name  is  similar  to  the  index  of  the  RZh  Metallurgiya  [9]. 

2.  Index  for  the  publication  type.  The  search  with  respect  to  type 
of  publication  may  be  performed  in  two  aspects:  with  respect  to  form 
(article,  book,  etc.)  and  with  respect  to  content  (theoretical,  ap¬ 
plied,  etc.). 

The  code  for  the  publication  type  with  respect  to  content  is  shown 
in  Table  1. 

The  code  for  the  publication  type  with  respect  to  form  is  shown  in 
Table  2,  The  key  4-2-1  is  used  (part  of  the  key  7-4-2-1).  It  permits 
establishing  the  correspondence  between  a  pair  of  openings  and  the  num¬ 
bers  from  1  to  6. 

3.  Language  index.  For  the  language  index  we  use  the  card  field 
which  in  the  standard  cards  of  the  RZh  Metallurgiya  is  denoted  by  the 
term  ’’branch.”  The  code  for  the  language  Index  is  shown  in  Table  3. 

4.  The  chronological  index  within  the  limits  of  one  decade  is  sim¬ 
ilar  to  the  index  of  the  RZh  Metallurgiya  of  VINITI  of  the  Academy  of 
Sciences  USSR  (Table  *<). 
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TABLE  1 

Publication  Type  by  Content 
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2  *■' 

Tun  twptu 
$ 

TwpeiHMCCKid  MITtM«THMC> 
CKIX  . 

1 

9 
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exm  . 

7  UWa 

1  npi:M«H«s . 

3 

4  * 

3  Odscp,  <SiiC.iHorpa$Hn . 

7  me.il. 

notches*  1  <*  «< thodologlcal  nature  Is  coded  by  two  shallow 

a  lorg  blbliosraDhvthltPis1in  notAsur,ey  paper,  but  contains 

8  D1D-L1°srapny ,  it  is  in  addition  coded  with  a  notch  4  (slit). 

notch  type;  4)  theoretical  mathematics*  5)  shal- 
MMiograph?  physico-chemical ;  7)  slit;  8)  applied;  9)  review. 


TABLE  2 

Publication  Type  by  Form 


tlpiMait 

l 

Km 

2 
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OTtepCTH* 

Tun  »Mpeia 

If 

5  Crime . 

i 

9 

TjiyteKKft  S 

7  Kimri,  uoHorpajiHa,  TpyAu  kox- 
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8  Aaccepnip* . 

* 

i+2 

• 

Mc/ikhA  9 
rjiytfoKHft  6 

MmkhA  9 
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l  0  Oner  . . 

1  1 

1  2  S[0MPb'  noco®Me  00  nepcaofly 

5 

A 

H4 

2+4 

co$?; .3)  comhination  of  openings;  4)  notch  type;  5)  ar 
ffli-iA  book,  monograph,  conference  transactions-  8)  disser- 

tendtoik9’  ShaU°Ki  10>  reP°rti  U)  translation  aid"!*" 
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TABLE  3 


Language  Index 


r  fan 

1 
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If 
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7PyccK*l  .  .  .  .  . 

an 

raytfoKHft  * 

JHfncmmft 

*  .  . 

1  1  Jtaoncicaft  ... 

1  fflpoaie 

• 

} 

5 

e 

7 

8 

8 

0 

1+2 

1+41  2 
AMtootprt 

2+7  F* 

1+74J**ep*l 

* 

MWKICA  9 
r^yCoKu*  6 
MtAKNl  9 

.  » 

» 

» 

» 

» 

ll8htnf i^deep ; J 7 ) °Russian ; °8 ) ^German • °9 ) Shallow ^  ?nl°S  type:  5>  Eng* 
anesej  12)  reserve;  13)  other.  '  9  shallowi  10>  French;  11)  jap- 


TABLE  i| 


Chronological  Index 


r«w 

i 

L2— M - 

Tan  II 
aupcia  Toa 

**  II  1 

|  Ko* 

CoerraHwc 

eraepctMft 

Net 

■Mpta* 

liiil 

1 

2 

3 

4 

5 

1+2 

1+4 

S 

TayfioKNA 
„  »  t 

MejixxA  j 
rayfioxHA 
MenauA* 

Psgii 

6 

7 

8 

9 

0 

2+4 

1+7 

2+7 

4+7 

MmkkB 
rnyOoKHftS 
McjikmA  6 

» 

» 
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For  coding  Publication,  from  1951  to  l960  we  use  the  cane  system 

°  bUt  ^  tMS  *"  «««•»  «>.  closest  upper  bole  to  the 

.1-  on  the  right  1.  punched.  The  publication,  prior  to  1951  are  coded 
by  a  slit  alongside  the  field. 

5.  The  subject  Index  1,  shown  In  Table  5.  It  consists  or  11  major 
eullngs  1.0,  1.1 . 2.0,  each  of  which  Is  divided  Into  subheadings 

Which  are  certain  concept,  which  are  elects  (words,  of  the  system  In- 
formation  language. 

On  the  card  there  remain,  room  for  two  decimal  headings  n 
These  heading,  remain  In  reserve  -  they  win  be  filled  ^mdex- 
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ing  system  is  expanded. 

Fields  of  application.  In  the  great  majority  of  cases  each  of  the 
publications  may  relate  only  to  some  general  field  of  application. 
Therefore  for  thio  division  we  use  the  combination  code  which  has  par¬ 
ticularly  high  capacity:  100  concepts  may  be  arranged  on  60  holes.  For 
the  section  "fields  of  application"  we  use  that  part  of  the  field  which 
in  the  standard  cards  of  the  RZh  Metallurgiya  has  the  label  "country." 
The  right  side  of  this  field  is  used  for  the  numerical  code  0-9 ,  the 
left  side  is  used  for  denoting  the  first  digit  of  a  binary  number.  Tne 
code  is  shown  in  Table  6. 

Examples.  Number  26  denotes  "Study  of  particles,  grains,  etc."  The 
digital  and  literal  codes  indicate  that  in  the  tens  field  there  will  be 
punched  two  shallow  notches  2+1,  and  in  the  ones  field  there  will  al¬ 
so  be  two  shallow  notches  2  +  ^.  Ths  number  30  denotes  "Medicine."  In 
this  case  in  the  tens  field  there  is  punched  a  single  deep  notch  kt 
while  in  the  ones  field  there  are  two  shallow  notches  4  +  7 . 
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Sciences  All-Union  Institute  of  Scientific  and  Technical 
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163  M  =  M  =  melkiy  =  shallow 
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164  rOCTbi  =  GOSTy  =  Gosudarstvennyye  Obshchesoyuznyye  Standarty 

=  All-Union  State  Standards 

167  T  =  G  =  glubokiy  =  deep 
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Appendix  1 

CLASSIFICATION  OF  STATISTICAL  METHODS  IN  THE 
INTERNATIONAL  JOURNAL  OF  STATISTICAL  ABSTRACTS 


0.  Mathematical  methods 
0.  General  articles 

1.  Solution  of  equations 

2.  Methods  of  constructing 
curves  from  points 

3.  Interpolation  and  quadrature 

Special  functions  and  their 
transformations 

5.  Functional  relations 

6.  Determinants  and  matrices 

7.  Game  theory 

8.  Programming  techniques 

9.  Reserve 

1.  Probability  theory 
C.  General  articles 

1.  Probability  calculation 

2.  Parameters  of  a  general  en¬ 
semble  (individual  estimates) 

3.  Problems  of  combinatorics 

4.  Geometrical  probability 

5.  Limit  theorems 

6.  Convergence  in  probability 

7.  Stochastic  approximation 


8.  Theory  of  decision  making  and 
preference  function 

9.  Reserve 

2 .  Distribution 

0.  General  articles 

1.  Descriptive  properties 

2.  Transformations  of  variables 

3.  Normal  and  lognormal 

4.  Binomial,  multivariate,  normal, 
and  hypergeometric 

5.  Poisson,  exponential,  negative 
binomial,  logarithmic,  and  the 
distribution  which  depends  on 
random  parameters 

6.  Rectangular  and  extremal  value 

7.  Pearson  and  "time  series" 

8.  Truncated  and  mixed 

9t  Other  distributions 

3.  Sampling  distributions 

0.  General  articles 

1.  t,  a,  F,  and  x2  distributions 

2.  Noncentral  distributions 

3.  "Studentization";  normal  approxi 
mations 

4.  Quadratic  forms 
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5.  Correlation  and  regression 
coefficients 

6.  Ordering  and  statistical 
scales 

7.  Descriptive  statistics 

8.  Order  statistics 

9.  Multifactor  problems 
Estimates 

0.  General  articles 

1.  Properties  of  estimates 

2.  Types  of  estimates:  fundamen¬ 
tal  theory 

3.  Individual  estimates:  the 
point 

4.  Same:  the  interval 

5.  Inequalities;  admissible 
limits  and  regions 

6.  Methods  which  do  not  use  dis¬ 
tribution  functions 

7.  Sequential  methods 

8.  Multifactor  problems 

9.  Finite  ensembles  -  prelimi¬ 
nary  investigations 

5.  Hypothesis  testing 

0.  General  articles 

1.  Criteria  properties 

2.  Single  hypotheses 

3.  The  dual  campling  problem 

U.  The  Jc-sampling  problem 

5.  Sharply  differing  values 

6.  Criteria  which  do  not  use 
distributions 

7.  Sequential  criteria 


8.  Multifactor  problems 

9.  Reserve 

6.  Equations 

0.  General  articles 

1.  Regression  -  the  linear  hypothe¬ 
sis 

2.  Correlation 

3.  Factor  methods 

4.  Other  multifactor  methods 

5.  Ranking  methods 

6.  Systems  of  equations;  structure 

7.  Nonlinear  equations  —  the  logis¬ 
tic  curve 

8.  Equation  transformation  -  start¬ 
ing  with  two  possibilities 

9.  Correspondence  and  conjugacy 

7 .  Dispersion  analysis 

0.  General  articles 

1.  Model  with  constant  factors 

2.  Model  with  random  factors 

3.  Mixed  and  other  models 

4.  Nonorthogonal  observations  and 
missing  values 

5.  Nonstandard  conditions  (violation 
of  premises 

6.  Covariational  analysis 

7.  Multiple  comparisons;  processes 
of  multiple  decision  making 

8.  Data  ranking 

9.  Sequential  methods 
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8.  Sample  planning  (sampling 
plans) 

0.  General  articles 

1.  Random  samples;  stratified 
samples,  multi-stage  samples 

2.  Samples  with  unequal  proba¬ 
bilities 

3.  Multiple  samples;  dual  sam¬ 
ples 

4.  Natural  populations 

5.  Nonsampling  problems 

6.  Censored  samples,  systematic 
samples,  and  group  samples 

7.  Nature  and  number  of  objects; 
cost  and  effectiveness 

8.  Acceptance  inspection 

9.  Process  control 

9.  Experiment  planning 

0.  General  articles 

1.  Planning  with  partitioning 
into  blocks,  complete  and 
incomplete 

2.  Factor  plans 

3.  Response  surfaces 

4.  Nature  of  objects;  number  of 
replicas;  cost  and  effective¬ 
ness 

5.  Paired  comparisons  and  com¬ 
parison  problems 

6.  Preference  criteria 

7.  Repetition  of  experiments; 
other  planning 

8.  The  weighting  problem 

9.  Sensitivity  problems 


10.  Theory  of  random  processes  and 
analysis  of  time  series' 

0.  General  articles 

1.  Properties  of  individual  process¬ 
es 

2.  Estimate  problems 

3.  Hypothesis  verification 

4.  Queues  and  weighting-line  theory 

5.  Information  theory 

6.  Spectral  analysis 

7.  Auto-  and  serial  correlation 

8.  Multivariate  processes 

9.  Investigations  in  biology,  popu¬ 
lations 

11.  Miscellaneous 

0.  General  statistical  methodology 

1.  Statistical  tables  and  diagrams 

2.  Probability  paper 

3.  Nomography  and  graphical  methods 

4.  Machine  methods:  manual  and  with 
the  use  of  punchcards 

5.  Machine  methods:  electronic  com¬ 
puters 

6.  Machine  methods:  cthe. 

7.  Monte  Carlo  methods 

8.  Indexing 

9.  History,  biography,  and  bibliog¬ 
raphy 
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Appendix  2 

CLASSIFICATION  IN  THE  AMERICAN  ABSTRACT  JOURNAL 
"QUALITY  CONTROL  AND  APPLIED  STATISTICS" 

100:  Statistical  industrial  control 
110:  Cards  for  quality  control 

111:  Control  cards  for  factors 
112:  Control  cards  for  properties 
120:  Specifications,  tolerances,  and  process  capabilities 
130 :  Distribution  functions  in  process  control 
190 :  Miscellaneous 

200:  Sample  types  and  sampling  plans 
210:  Sample  types 

211:  Mass  sampling 
212:  Discrete  unit  sampling 
219:  Other 
220:  Sampling  plans 

221:  Selecting  the  correct  sampling  plans 

222:  Plans  for  control  of  properties  (Attribute?) 

223:  Plans  for  control  of  factors  (Variables) 

230 :  Census  and  survey 
290 :  Miscellaneous 

300:  Control  of  quality  control 

310:  Formulating  the  quality  control  task 
320:  Education  in  quality  control 
330:  Organization  of  quality  control 
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331:  Personnel 

340:  Quality  control  administrative  engineering 
341:  Recording  and  reports  on  quality 
342:  Standards  and  methods 
343*.  Incentive  plans 
349:  Other  administrative  methods 
350 :  Economics  of  quality 

351:  User-vendor  relationship 
353:  Quality  standards 
359:  Miscellaneous 
390:  Miscellaneous 

400:  Mathematical  statistics  and  probability  theory 
410:  Estimate  theory  and  statistical  conclusions 
420:  Properties  of  distribution  functions 
421:  Normal  distribution 
422:  Poisson  distribution 
423:  Binomial  distribution 
424:  Mixed  distribution 
425:  Distribution  of  extremal  values 
429:  Other  distributions 
430 :  Probability  theory 
490 :  Miscellaneous 

500 :  Experiments  and  the  study  of  relationships 

510:  Significance  criteria  and  confidence  intervals 
520 :  Planning  and  analysis  of  experiments 
521:  Experiment  organization 
522:  Experiment  planning 

522. 1:  Complete  blocks  (factor  plans,  Latin  squares, 
randomized  blocks,  etc.) 
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522.2:  Incomplete  blocks  (Oudln  squares,  etc.) 
522.3:  Sequential  plans 
522.4:  Fractional  plans 
522.5:  Response  surfaces 
522.9:  Other 

523:  Special  methods  of  data  analysis 
523. 1:  Data  conversion 
523-2:  Missing  values 

523-3:  Prominent  values  and  data  processing 
523.4:  Dispersion  analysis  of  ranked  data 
523.5:  Table  of  characteristic  conjugacy 
523.9:  Other 
529:  Miscellaneous 

530:  Correlation 

531:  Pairwise  correlation 
532:  Multiple  correlation 
533:  Rank  correlation 
534:  Covariational  analysis 
539:  Other 

540:  Construction  of  empirical  curves 
541:  Linear  regression 
542:  Nonlinear  regression 
543:  Multiple  regression 
544:  Orthogonal  polynomials 
545:  Time  series 
546:  Other 

550:  Simplified  analysis  methods 
551:  Nonparametric  tests 
552:  Graphical  analysis  of  data 
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553:  Other 
590:  Miscellaneous 
600:  Application  to  equation .* 

610:  Operations  research 
6ll:  Data  collection 

612:  Special  methods  and  their  application 

612.1:  Programming 

612.11:  Transport  problems 

612.12:  The  assignment  problem 

612.13:  The  sequence  problem 

612. 14:  The  traveling  salesman  problem 

612.15:  The  porfolio  problem 

612.2:  Information  theory 

612.3:  Game  theory 

612. 4:  Queueing  theory 

612.5:  Symbolic  logic 

612.6:  Dynamic  programming 

612.7:  Other  mathematical  methods 

612.8:  Modeling  processes 

612.9:  Other 

613:  System  analysis 

620:  Engineering  industrial  method 

622:  Compilation  of  graphs  and  control  (production,  inven¬ 
tory,  etc.) 

623:  Work  measurement  and  wage  planning 
625:  Equipment  replacement 
629:  Other 

630 :  Commercial  methods 

631 :  Measurement  and  analysis 


* 


J 


V 
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632:  Prediction 
633:  Finance  policies 
634:  Sales  control 
639:  Other 
690:  Miscellaneous 
700:  Measurement  and  control 

710:  Measurement  of  quality  characteristics 
711:  Physical  properties 
711.1:  Length,  volume 
711.2:  Mass,  density 

711.3:  Capillary,  colloidal,  and  surface  properties 

711.4:  Mechanical  properties,  including  rheological, 
ultimate  strength,  pressure 

711.5:  Electrical  properties,  including  resistance,  di 
electric  and  magnetic  properties 

711.6:  Optical  properties,  including  external  appear¬ 
ance,  transparency,  color,  absorption  spectro¬ 
photometry 

711.7:  Thermal  properties,  including  temperature,  re¬ 
sistance,  expansion,  heat  capacity 

711.9:  Miscellaneous 

712:  Dynamic  properties 

712. 1:  Velocity,  acceleration 

712.2:  Material  flux,  its  calculation 

712.9:  Miscellaneous 

713:  Structural  properties 

713. 1:  Molecular  structures 

713*2:  Internal  structures 

713.9:  Miscellaneous 

714:  Chemical  properties 

714.1:  Chemical  composition 
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714.2:  Chemical  reactions 
714.9:  Miscellaneous 


715:  Atomic  and  nuclear  properties 
716:  Aging  properties  and  wear 
717:  Subjective  characteristics 
720:  Process  control  and  automation  « 

730 :  Computers  and  special  devices  for  applied  statistics 
800 :  Reliability  of  complex  ensembles 

810 :  Control  of  the  reliability  functions 
811 :  Organization 
812:  Training 

813:  Data  collection  and  reporting 
8l4;  Reliability  standards 
815:  Reliability  specifications 
816:  Vendor-buyer  relationship 
819:  Miscellaneous 

820 :  Mathematical  theory  of  reliability 
821:  Models  of  probability  chains 
822:  Distribution  functions 
823:  Sampling  plans  for  reliability  estimate 
824:  Experiment  planning  and  reliability 
829:  Miscellaneous 
830 :  Aspects  of  system  reliability 

831 :  Systems  with  objective  reliability 

832:  Human  engineering  aspect 

833:  Selection  of  components 

834:  Gain  with  redundancy 

835:  Suitability  of  predictions 

839:  Miscellaneous 


840:  Reliability  improvement 
841:  Materials 
842:  Components 
843:  Ensembles 
844:  Systems 

848:  Tests,  means,  and  development 
849:  Miscellaneous 

850:  The  field  of  reliability  and  operation 
890:  Miscellaneous 
:A  Industry  and  production 
:Aa  Mechanical 

:Ab  Electrical  and  electronic 

:Ac  Chemical,  petroleum,  and  pharmaceutical 

:Ad  Mining,  ore-processing,  and  casting 

:Ae  Construction  work 

:Af  Construction  timber,  wood  and  paper 

:Ag  Plastics  and  rubber 

:Ah  Leather 

:Ai  Textile 

:Aj  Ceramics  and  glass 
:Ak  Graphics 

:A1  Foodstuffs  and  beverages 
:Am  Packing  industry 
:An  Rockets  and  spacecraft 
:Az  Miscellaneous 
:B  Trade  and  incentives 
:C  Banking,  finance,  and  insurance 
:D  Research  and  development 
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:Da 

:  Same  as  for  :A 

:Dz 

:E  Military,  naval,  and  government  operations 
:F  Social  sciences,  including  psychology  and  education 
:Q  Business  correspondence 
:H  Biometrics  (agronomy,  genetics,  etc.) 

:I  Medicine  and  related  questions 
:J  Utilities  and  transport 

:K  Earch  sciences  (geophysics,  meteorology,  oceanography,  etc.) 

:L  General  questions  of  administration 

:M  Service  organization  (garages,  restaurants,  etc.) 

:N  Econometrics 
:Y  General  applications 

:Ya  Estimate  of  correctness  and  accuracy 
:Yb  Testing  for  lifetime 
:Yc  Subjective  criteria 
:Z  Other 
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176  Actually,  for  the  publications  appearing  in  June  of  I960  the 

abstracts  on  operations  research  are  included  in  QCAS  only  to 
the  extent  to  which  the  original  works  were  concerned  direct¬ 
ly  with  quality  control  or  reliability.  For  complete  coverage 
of  the  literature  on  operations  research  the  subscribers  are 
referred  to  the  service  of  Executive  Sciences  Institute, 
OR/MS. 
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Appendix  3 

CLASSIFICATION  SCHEME  FOR  BIBLIOGRAPHY  OF  THE 
INTERNATIONAL  STATISTICAL  INSTITUTE 

0.  General  1.11  Miscellaneous,  includ¬ 

ing  statistical  tables 

0.1  New  periodic  publications  and  nomograms 


0.2  Conference  transactions, 
me’wri&l  editions 

0.3  Bibliography 

0.4  Dictionaries,  glossaries 

0.5  History  of  statistics  and 
its  bibliography 

0.6  Teaching  of  statistics 

0.9  Miscellaneous 

1.  Statistical  theory 

1.0  General 

1.1  Probability 

1.2  Distribution  functions 

1.3  Sampling  distributions 

1.4  Estimates 

1.5  Hypothesis  testing 

1.6  Equations 

1.7  Dispersion  analysis 

1.8  Sampling  plans 

1.9  Experiment  planning 

1.10  Theory  of  random  proc¬ 
esses  and  analysis  of 
time  series 


2.  Applications  of  statistics 

2.0  General 

2.1  Economics 

2.2  Sociology  and  psychology 

2.3  Demography  and  the  sci¬ 
ence  of  insurance 

2.4  Biology  and  medicine 

2.5  Physical  sciences  and 
technology 

2.6  Operations  analysis  and 
the  science  of  control 

2.9  Miscellaneous 

3.  Systematization  of  data 

3.0  General 

3.1  Legislation  and  organi¬ 
zation 

3.2  Problems  of  definition 
and  classification 

3.3  Statistics  of  basic  ma¬ 
terials 

3.4  Practical  sampling  prob¬ 
lems 

3*5  Process  data 
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3.6  Data  representation 
3.9  Miscellaneous 
4.  Statistical  data 
4.0  General 

4.1  Populations  and  public  * 

health 

4.2  Social  statistics 

4.3  Statistics  of  the  cul¬ 
tural  life  and  criminal 
statistics 

4.4  Agrarian  statistics 

4.5  Industrial  statistics 

4.6  Trade  and  transport  sta¬ 
tistics 

4.7  Financial  statistics 

4.8  Government  accounting 

4.9  Miscellaneous 

A.  Experiment 

1.  Experiment  planning 

2.  Experiment  conduct 

3.  Analysis  of  results 

B.  Preliminary  inspection 

1.  Sampling  plans 

2.  Systemization  of  data 

# 

3.  Publication  of  data 

4.  Analysis  of  results 

v 
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